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mmiQnfi yop toexttmg MOSCLE DISEASES AND DISORDERS 

BttcKgrgmd flf tho invOTtlgn 
The Invention relates to prophylactic or affirmative 

S treatsent of diseases and disorders of the nusculature by 
adainistering polypeptides found in vertebrate species^ 
idiidh polypeptides are grovth, differentiation and survival 
factors for auscle cells* 

Mosole tissue in adult vertebrates vill regenerate 

10 froB reserve myoblasts called satellite cells. Satellite 
cells are distributed throughout auscle tissue and are 
mitotically quiescent in the absence of injury or disease. 
Polloving muscle injury or during recovery from disease, 
satellite cells vill reenter the cell cycle, proliferate and 

15 1} enter existing muscle fibers or 2) undergo 

differentiation into multinucleate myotubes vhicAi form nev 
muscle fiber. The ivoblasts ultimately yield r^lacement 
muscle fibers or fuse into existing muscle fibers, thereby 
increasing fiber girth by the synthesis of contractile 

20 apparatus components. This process is illustrated, for 

exaa^le, by the nearly complete regeneration which occurs In 
mammals following induced muscle fiber degeneration; the 
muscle progenitor cells proliferate and fuse together 
regenerating muscle fibers. 

25 Several growth factors which regulate the 

proliferation and differentiation of adult (and embryonic) 
myoblasts in vitoo have been identified. Fibroblast growth 
factor (FGF) is mitogenic for muscle cells and is an 
inhibitor of muscle differentiation. Transforming growth 

30 factor 0 VtQFfi) has no effect on myoblast proliferation, but 
is an inhibitor of muscle differentiation. Insulin-like 
growth factors (IGPs) have been shown to stimulate both 
myoblast proliferation and differentiation in rodents. 
Platelet derived growth factor (PD6P) is also mitogenic for 

35 myoblasts and is a potent inhibitor of muscle cell 
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dlffermtiatlosi see: Florlni and Nagri, 1989:256:0701* 
C711). 

Zn vertebrate species both muscle tissue and 
neurons are potential sources of factors which stlsulate 
5 myoblast proliferation and differentiation. In diseases 
affecting the neuromuscular system iriiich are neural in 
origin (i.e., neurogenic), the muscle tissue innervated 
by the affected nerve becomes paralysed and vastes 
progressively. During periidieral nerve regeneration and 

10 recovery from neurologic and myopathic disease, neurons 
may provide a source of growth factors «diich elicit the 
muscle regeneration described above and provide a 
mechanism for muscle recovery from wasting and atrophy. 

k recently described family of growth factors, the 

15 neuregulins, are synthesised by motor neurons (Marconi 
et al. Baton 362:313, 1993) and inflammatory cells 
(Tarakhovaky et al.. Oncogene 6:2187-2196 (1991)). The 
neuregulins and related pias^'^ binding factors have 
been purified, cloned and expressed (Benveniste et al., 

20 PMAS AZ:3930*3934, 1985; Rimura et al.. Nature 

211:257-260, 1990} Davis and Stroobant, J. cell. Biol, 
lift: 1353-1360, 1990; Hen et al.. Ceil fi2:559, 1992; 
Yarden and Ullrich, Jinn. Rbv. Biochem. 22:^^3, 1988; 
Holmes et al.. Science 2Sfi:i205, 1992; Dobashi et al., 

25 Froc. BetJ. ilcad. 5ci. Aft:8982, 1991; Iavu et al., Proc. 
Bati. Jkcad. Sci. A2:2287, 1992). Recombinant neuregulins 
have been shown to be mitogenic for peripheral glia 
(Karchionni et al., Kature 362:313, 1993) and have been 
shown to influence the formation of the neuromuscular 

30 junction (Palls et al.. Cell 72:801, 1993). Thus the 

regenerating neuron and the inflammatory cells associated 
with the recovery from neurogenic disease and nerve 
injury provide a source of factors which coordinate the 
remyelination of motor neurons and their ability to form 

35 the appropriate connection with their target. After 
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nuscle has been relimervat d th a t r neuron may provid 
factors to suecle, etlaulatlng aiiscle growth and 
survival. 

Currently, there is no useful therapy for the 
5 proBOtion of nuscle differentiation and survival. Such a 
therapy would be useful for treataent of a variety of 
neural and atiscular diseases and disorders. 

Siiimnnry nf thn Tnygntion 
We have discovered that increased mitogenesis 

10 differentiation and survival of auscle cells aay be 
acdiieved using proteins heretofore described as glial 
growth factors, acetylcholine receptor inducing activity 
(ARIA), heregulins, neu differentiation factor, and, acre 
generally, neuregulins. We have discovered that these 

15 compounds are capable of inducing both the proliferation 
of ansole cells and the differentiation and survival of 
nyotubes. Ihese phenomena may occur in cardiac and 
saooth muscle tissues in addition to skeletal auscle 
tissues. Ibus, the above coapounds, regulatory coapounds 

20 trtiich induce synthesis of these coapounds, and saall 
aoleculss %fhi^ aimic these coapounds by binding to the 
reciters on auscle or liy stiaulating through other aeans 
the second aessenger systeas activated by the ligand* 
receptor complex are all extreaely useful as prophylactic 

25 and affiraative therapies for auscle diseases. 

A novel aspect of the Invention involves the use 
of the above naaed proteins as growth factors to induce 
the aitogenesis, survival, growth and differentietion of 
auscle cells. Treating of the muscle cells to achieve 

30 these effects aay be achieved by contacting auscle cells 
with a polypeptide described herein. The treataents aay 
be provided to slow or halt net auscle loss or to 
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increase the aacnmt or quality f ausele pr v nt in the 
vertebrate. 

These factors aay be used to produce muscle cell 
mitogenesis, differentiation, and survival in a 
S vertebrate (preferably a BaBsal, acre preferably a huaan) 
by administering to the vertebrate an effective amount of 
a polypeptide or a related compound. Neuregulin effects 
on muscle may ocouTi for example, by causing an increase 
in muscle performance by inducing the synthesis of 

10 particular isoforms of the contractile apparatus such as 
the myosin heavy chain slow and fast isoforms; by 
promoting muscle fiber survival via the induction of 
synthesis of protective molecules such as, but not 
limited to, dystrophin; and/or by increasing muscle 

15 innervation by, for exa^le, increasing aoetylofaoline 
receptor Mlecules at the neuromuscular junction. 

The term muscle cell as used herein refers to any 
cell which contributes to muscle tissue. Myoblasts, 
satellite cells, myotubes, and myofibril tissues are all 

20 included in the term "muscle cells" and may all be 

treated using the methods of the invention. Muscle cell 
effects may be induced vithln sksletal, cardiac and 
smooth muscles. 

Mitogenesis may be induced in muscle cells, 

25 including myoblasts or satellite cells, of skeletal 

muscle, smooth muscle or cardiac muscle. Mitogenesis as 
used herein refers to any cell division which results in 
the production of new muscle cells in the patient. Ndre 
specifically, mitogenesis in vitro is defined as an 

30 increase in mitotic index relative to untreated cells of 
50%, more preferably lOOt, and most preferably 300%, when 
the cells are ei^osed to labelling agent for a time 
equivalent to two doubling times. The mitotic index is 
the fraction of cells in the culture which have labelled 
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nuclei vhen grown in the presence of a tracer vhidh only 
incorporates during S phase (i.e., BrdU) and the doubling 
tiae is defined as the average tiae required for the 
nuBd»er of cells in the culture to Increase by a factor of 
5 two). 

An effect on aitogenesis in vivo is defined as an 
increase in satellite cell activation as aeasured by the 
appearance of labelled satellite cells in the auscle 
tissue of a aaaaal exposed to a tracer t^ich only 

10 incozporates during 8 phase (i.e., BrdU). In useful 
therapeutic is defined in vivo as a coqpound which 
increases satellite cell activation relative to a control 
aaaaal by at least 10%, acre preferably by at least 50%, 
and aost preferably by acre than 200% vhen the aaaaal is 

15 exposed to labelling agent for a period of greater than 
15 ainutes and tissues are assayed between 10 hours and 
34 hours after adainistration of the altogen at the 
therapeutic dose. Alternatively, satellite cell 
activation in vivo aay be detected by aonitoring the 

20 appearance of the interaedlate fllaaent viaentln by 

iaaunologlcal or MIA analysis aethods. When viaentln is 
assayed, the useful altogen Is defined as one which 
causes expression of detectable levels of viaentln in 
the auscle tissue when the therapeutically useful dosage 

25 is provided. 

Ifyogenesis as used herein refers to any fusion of 
ayoblasts to yield ayotubes. Most preferably, an effect 
on ayogenesls is defined as an Increase in the fusion of 
ayoblasts and the enableaent of the auscle 

30 differentiation prograa. The useful pyogenic therapeutic 
is defined as a coqpound which confers any increase in 
the fusion index in vitro. More preferably, the congpound 
confers at least a 2.0*fold Increase and, aost 
preferably, the coapound confers a 3-fold or greater 



inerMM in the fusion indax relative to the control. 
The fusion index is defined as the fraction of nuclei 
present in aultinucleated cells in the culture relative 
to the total nuBber of nuclei present in the culture. 
5 ^e percentages provided above are for cells assayed 

after 6 days of exposure to the myogenic cospound and are 
relative to an untreated control, Xyogenesis say also be 
deterained by assaying the nusber of nuclei per area in 
ayotubes or by Beasurement of the levels of mscle 

10 specific protein lav Western analysis. Preferably, the 
campomA confers at least a 2.0-fold increase in the 
density of myotubes using the assay provided, for 
example, herein, and, Bost preferably, the coapound 
confers a 3-fold or greater increase. 

15 The grovth of suscle may occur by the increase in 

the fiber sise and/or by increasing the nusber of fibers. 
The grovth of nuscle as used herein Bay be Beasured by A) 
an increase in vet veight, B) an increase in protein 
content, C) an increase in the nuBber of Buscle fibers, 

20 or D) an increase in auscle fiber diaaeter. An increase 
in grovth of a Busele fiber can be defined as an increase 
in the diaaeter vhere the diaaeter is defined as the 
ainor axis of ellipsis of the cross section. The useful 
therapeutic is one vhich increases the vet veight, 

25 protein content and/or diaaeter by lot or aore, acre 
preferably by aore than 50% and aost preferably by aore 
than 100% in an aniaal vhose auscles have been previously 
degenerated by at least 10% and relative to a siailarly 
treated control aniaal (i.e., an aniaal vith degenerated 

30 auscle tissue vhich is not treated vith the auscle grovth 
coapound) . A coapound irtiich increases grovth by 
increasing the nuaber of auscle fibers is useful as a 
therapeutic vhen it increases the nuaber of fibers in the 
diseased tissue by at least 1%, aore preferably at least 
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20%, and Bost pr ferably, by at least 50%. Shaoa 
percentages are detenined relative to the basal level in 
a coBQ>arable untreated undlseased aaanal or in the 
contralateral undiseased suscle vhen the eeopound is 
5 adBlnlstered and acts locally* 

the survival of suscle fibers as used herein 
refers to the prevention of loss of auscle fibers as 
evidenced by necrosis or apoptosis or the prevention of 
other sechanisas of nuscle fiber loss. Survival as iised 

10 herein indicates an decrease in the rate of cell death of 
at least 10%, ttore preferably by at least 50%, and Bost 
preferably by at least 300% relative to an untreated 
control. The rate of survival aay be aeasured by 
counting cells stainable vith a dye specific for dead 

15 cells (sucdi as propidius iodide) in culture when the 

cells are 8 days post-differentiation (i«e«, 8 deys after 
the aedia is Changed fros 20% to 0.5% serus). 

Nuscle regeneration as used herein refers to the 
process by vhi^ nev suscle fibers f on f roB auscle 

20 progenitor cells. The useful therapeutic for 

regeneration confers an increase in the nuaber of nev 
fibers by at least 1%, acre preferably by at least 20%, 
and BOst preferably by at least 50%, as defined above. 

The differentiation of auscle cells as used herein 

25 refers to the induction of a auscle developaental prograa 
which specifies the coaponents of the auscle fiber such 
as the contractile apparatus (the ayofibril). The 
therapeutic useful for differentiation increases the 
quantity of any* coq^nent of the auscle fiber in the 

30 diseased tissue by at least 10% or acre, aore preferably 
by 50% or aore, and aost preferably by aore than 100% 
relative to the equivalent tissue in a siailarly treated 
control aniaal. 



Atrophy of auscle as used herein refers to a 
significant loss in muscle fiber girth. By significant 
atrophy is aeant a reduction of auscle fiber dlaaeter in 
diseased, injured or unused muscle tissue of at least 10% 
5 relative to undiseased, uninjured, or normally utilised 
tissue. 

Methods for treatment of diseases or disorders 
using the polypeptides or other compounds described 
herein are also part of the Invention. Bmaaqples of 

10 muscular disorders vhich may be treated include Aieletal 
muscle diseases and disorders such as myopathies, 
dystrophies, ^oneural conductive diseases, traumatic 
muscle injuiry, and nerve injury, cardiac muscle 
pathologies suCh as cardiomyopathies, ischemic damage, 

15 congenital disease, and traumatic injury may also be 

treated using thia methods of the invention, as may smooth 
muscle diseases and disorders such as arterial sclerosis, 
vascular lesions, and congenital vascular diseases. For 
example, Duohennes muscular dystrophy, Beckkers' 

20 dystrophy, and Ifyasthenia gravis are but three of the 
diseases vhich may be treated using the methods of the 
invention. 

The invention also inoludss methods for the 
prophylaxis or treatment of a tumor of muscle cell origin 

25 such as rhabdomyosarcoma. These inethods include 

administration of an effective amount of a substance 
vhich inhibits the binding of one or more of the 
polypeptides described herein and inhibiting the 
proliferation of the cells vhich contribute to the tumor. 

30 The methods of the invention may also be used to 

treat a patient suffering from a disease caused by a lade 
of a neurotropic factor. By ladlcing a neurotrophic 
factor is meant a decreased amoxint of neurotrophic factor 
relative to an unaffected individual sufficient to cause 
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d tftctable decrease in neuronuscular conneetlone and/or 
Buscular strength* The neurotrophic factor may be 
present at levels 10% belov those observed in unaffected 
individuals. More preferably, the factor is present at 
5 levels aot lover than are observed in unaffected 

individuals, and most preferably the levels are lovered 
by 80% relative to unaffected individuals under siailar 
circumstances • 

The nethods of the invention nake use of the fact 

10 that the neuregulin proteins are encoded by the sane 

gene. A variety of messenger RHA splicing variants (and 
their resultant proteins) are derived from this gene and 
many of these products shov binding to PlBSl^'^ and 
activation of the same. Products of this gene have been 

15 used to shov muscle cell mitogenic activity (see Eicamples 
1 and 2 , belov) , differentiation (Examples 3 and 6) , and 
survival (Examples 4 and 5). This invention provides a 
use for all of the knovn products of the neuregulin gene 
(described herein and in the references liiited above) 

20 vhich have the stated activities as muscle cell mitogens, 
differentiation factors i and stirvival factors. Most 
preferably, recombinant human G6P2 (rhG6P2)is used in 
these methods. 

The invention also relates to the use of other, 

25 not yet naturally isolated, splicing variants of the 
neuregulin gene. Fig. 29 shovs the knovn patterns of 
splicing. These patterns are derived fr<MB polymerase 
chain reaction experiments (on reverse transcribed RNA) , 
analysis of cONA clones (as presented vithin) , and 

30 analysis of published sequences encoding neuregulins 
(Poles et al.. Cell £2:205 (1992) and Hen et al.. Cell 
£2:359 (1992)). These patterns, as veil as additional 
patterns disclosed herein, represent probable splicing 
variants vhich exist. The splicing variants are fully 
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described in Goodearl et al., U8SN 08/036,555, filed 
March 24, 1993, incorporated herein by reference* 

More specifically, cell division, survival, 
differentiation and grovth of auscle cells aay be 
5 achieved by contacting muscle cells with a polypeptide 
defined by the f waula 



vherein HBASCX is coqposed of the polypeptide 
segnents shorn in Fig. 30 (SBQ ID Nos, 132, 134, 135, 

10 137*139, 156); vherein W coaqprises the polypeptide 
segaent F, or is absent; vherein Z coaprises the 
polypeptide segaent 6 or is absent; and wherein X 
coaprises the polyp^ide segaent C/D BRL, C/D R, C/D HL, 
C/D D, C/O' HL, C/0' BKL, B, C/0' D, C/0 C/D' HKL, 

15 C/D C/D' H, C/D C/D' HL, C/D C/D' D, C/D D' H, C/D D' HL, 
C/D D' RXL, C/D' D' H, C/0' D' HL, C/D' D' HKL, C/D C/D' 
D' H, C/D C/D' D' HL, or C/D C/D' 0' HKL and/or by 
contacting auscle cells vith a polypeptide defined by the 
f oraula 

20 YBkZCK 

vherein YBAZCX is coaposed of the polypeptide 
segaents shovn in Fig. 30 (8EQ ZD Nos. 133-135, 156, 
159) ; vherein Y convrises the polypeptide segaent E, or 
is absent; vherein Z coaprises the polypeptide segaent 6 

25 or is absent; and vherein X coaprises the polypeptide 

segaent C/D HXL, C/D H, C/D HL, C/D D, C/D' HL, C/D' HKL, 
C/D' B, C/D' D, C/D C/D' BKL, C/0 C/D' B, C/D C/D' BL, 
C/D C/D' D, C/D D' B, C/D D' BL, C/D D' BKL, C/D' D' B, 
C/D' 0' BL, C/D' D' BKL, C/D C/D' 0' B, C/D C/D' D' BL, 

30 or C/D C/D' D' HKL. 

Generally, the H^-terainus of the above-^described 
polypeptides begins vith either the F or B polypeptide 
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segments. When the F p lyp ptide Is present It nay be 
cleaved upon maturation of the protein to yield the 
mature polypeptide* When the E sequence is present the 
first 50 amino acids vhioh represent the N-»terminal 
5 signal seguence may be absent from the polypeptides. 

Furthermore, the invention includes a method of 
treating muscle cells by the application to the muscle 
cell of a 

->30 kO polypeptide factor isolated from the NDA-MB 
10 231 human breast cell line; or 

-35 KD polypeptide factor isolated from the rat I- 
B7 transformed fibroblast cell line to the glial cell or 

*75 kD polypeptide factor isolated from the 
human breast cell line; or 
15 -44 kD polypeptide factor isolated from the rat I- 

BJ transformed fibroblast cell line; or 

-25 kD polypeptide factor isolated from activated 
mouse peritoneal macrophages; or 

-45 kD polypeptide factor isolated from the MDA - 
20 NB 231 human breast cell; or 

-7 to 14 kD polypeptide factor isolated from the 
ATL-2 human T-cell line to the glial cell; or 

-25 kO polypeptide factor isolated from the bovine 
kidney cells; or 
25 -42 kD ARIA polypeptide factor isolated from 

brain; -46-47 kD polypeptide factor which stimulates 

0-2A glial progenitor cells; or 

-43-45 kD polypeptide factor, G6FZII,175 
U.S. patent application Serial No. 07/931,041, filed 
30 August 17, 1992, incorporated herein by reference. 



The invention further includes methods for the use 
of the B6FL1, B6FL2, BGFL3, E6FL4, EGFL5, and BGFL6 
polypeptides. Fig. 37 to 42 and SEQ ID Hob. 150 to 155, 
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r speetivaly, f r th treataent of vusci cells in vivo 
and in YitW» 

Xloo included in the invention is the 
administration of the G6F2 polypeptide whose sequence is 
5 shorn in Pig. 44 for the treatment of suscle colls. 

An additional i^rtant aspect of tho invention 
are aethods for treating suscle cells using: 

(a) a basic polypeptide factor also known to have 
glial cell sitogenic aetivityr in the presence of fetal 
10 calf plasaa, a solecular weight of fron about 30 kD to 
about 36 kD, and including within its asino acid sequence 
any one or sore of the following peptide sequences: 





P 


X 


e 


0 


A 


B 


T 


B 




(8BQ ID NO: 1) 




A 


8 


L 


A 


D 


B 


Y 


B 


Y M X X 


(8BQ ID HO: 2) 
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T 


B 


T 


8 


S 


8 


6 


L 


X L K 


(SEQ ID MO: 3) 




A 


S 


L 


A 


D 


B 


Y 


B 
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(8EQ ID NO: 7) 
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E 


X 


A 
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(8BQ ID MO: 11) 
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(SEQ ID N0il3) 
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(8BQ ID MO! 14) 
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X 






(8BQ ID MO: 15) 
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(SBQ ID MO: 165) 
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(SEQ ID NO: 17) 
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X 


X 


F 


Y 


V 


F 






(SEQ ID MO: 19) 


25 


X 


L 


B 


F 


L 


X 


A 


X 


(8BB ID NOt 32); 


and 



(b) a basic polypeptide factor for use in 
treating suscle cells which is also known to stiaulate 
glial cell sitogenesis in the presence of fetal calf 
plassai has a solecular weight of fros about 55 kD to 
30 about 63 kD, and including within its asino acid sequence 
any one or sore of the following peptide sequences: 
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(SEQ ZD NO: 33) 
(SEQ ZD NO: 34) 
(SEQ ZD NO: 35) 
(SBQ ZD NO: 36) 
(8BQ ZD NO: 37) 
(SEQ ZD NO: 38) 
(SEQ ZD NO: 48) 
(SBQ ZD NO: 49) 
(SEQ ZD NO: 39) 

10 Mathods for the use of the peptide sequences set 

out above, derived fron the saaller solecular velgfat 
polypeptide factor, and from the larger molecular weight 
polypeptide factor, are also aspects of this invention. 
Monoclonal antibodies to the above peptides are 

15 theuelves useful investigative tools and therapeutics. 

Thus, the invention further enteaces aiethods of 
using a polypeptide factor having activities useful for 
treating suscle cells and including an anino acid 
sequence encoded by: 

20 (a) a DNA sequence shorn in any one of Figs. 27A, 

27B or 27C, SEQ ZD Nos. 129*131, respectively; 

(b) a ma sequence shovn in Pig. 31, SBQ ZD No. 

85; 

(c) the DMA sequence represented by nucleotides 
25 281-^557 of the sequence shorn in Fig. 27A, SEQ ZD No. 

129; or 

(d) a DNA sequence hybridizable to any one of the 
DNA sequences according to (a), (b) or (c). 

Following factors as vuscle cell adtogens: 
30 (a) a basic polypeptide factor which has, if 

obtained fron bovine pituitary naterial, an observed 
molecular weight, whether in reducing conditions or not, 
of from about 30kD to about 36kD on SDS-polyacrylaaide 
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gel alectrophoresio vhich factor has susele cell 
aitogenle activity Including atlsulatlng the division o£ 
syohlastSi and when Isolated using reversed*phase HPLC 
retains at least 50% of said activity after 10 veeks 
5 Incubation In 0*1% trlfluoroacetlc acid at 4*0; and 
(b) a basic polypeptide factor which has, If 
obtained fros bovine pituitary uterlal, an observed 
Bolecular vel^t, under non-reduclng conditions, of tram 
about 55 kD to about 63 Xd on SDS-polyacrylaalde gel 

10 electrophoresis vhlch factor the huaan equivalent of 
vhlch Is encoded by Wh clone 66F2HBS5 and vhlch factor 
has Buscle cell aitogenle activity and vhen Isolated 
using reversed«»phase HPLC retains at least 50% of the 
activity after 4 days Incubation in 0.1% trlfluoroacetlc 

15 add at 4*C. 

Thus other loportant aspects of the Invention are 
the use of: 

(a) A series of huaan and bovine polypeptide 
factors having cell aitogenle activity Including 

20 stlaulatlng the division of auscle cells. Ihese peptide 
sequences are shom In Pigs. 30, 31, 32 and 33, SBQ ZD 
Nos. 132--133, respectively. 

(b) A series of polypeptide factors having cell 
aitogenle activity including stlaulatlng the division of 

25 auscle cells and purified and characterised according to 
the procedures outlined by Lupu et al. Science 1552 
(1990); Lupu et al. Proc. Natl. Acad. Sol USA fl2: 2287 
(1992); Holaes et al. Science iSfi: 1205 (1992); Poles et 
al. £21 205 (1992); Yarden and Pales Blocheaistry 

30 3543 (1991); Oobashi et al. Proc. Natl. Acad. Scl. flfi: 
8582 (1991); Davis et al. Biochea. Blophys. Res. Coaaun. 
179 g 1536 (1991); Beauaont.et al., patent application 
PCT/n89l/03443 (1990); Bottensteln, D.8. Patent No. 



PCrAJS94/050a3 



- 15 - 

5,276,145, i88u d 1/4/94; and Green et al« patent 
appllcat^lon PCT/nS91/02331 (1990) . 

(c) A polypeptide factor (G6FBPP5) having glial 
cell aitogenic activity including stiaulating the 
5 division of musole oelle. The amino acid sequence ie 
ehovn in Fig. 31, 8BQ ZD No. 144. 

Methods for stimulating aitogenesis of a myoblast 
by contacting the myoblast cell with a polypeptide 
defined above as a muscle cell mitogen in vivo or in 
10 vitro are included as features of the invention. 

muscle cell treatments nay also be achieved by 
administering DHA encoding the polypeptide compounds 
described above in an expressible genetic construction. 
DNA encoding the polypeptide nay be administered to the 
15 patient using techniques Known in the art for delivering 
DMA to the cells. Per example, retroviral vectors, 
electroporation or liposomes may be used to deliver OKA. 

The invention includes the use of the above named 
family of proteins as extracted from natural sources 
20 (tissues or cell lines) or as prepared by recombinant 
means. 

other coaq^unds in particular, peptides, which 
bind specifically to the pl85^^^ receptor can also be 
used according to the invention as muscle cell mitogens. 

25 A candidate co^ound can be routinely screened for 

pl85^^ binding, and, if it binds, can then be screened 
for glial cell mitogenic activity using the methods 
described herein. 

The invention includes use of any modifications or 

30 equivalents of the above polypeptide factors vhieh do not 
eadiibit a significantly reduced activity. For exaa^le, 
modifications in which amino acid content or sequence is 
altered without substantially adversely affecting 
activity are included. The statements of effect and use 
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contained herein are therefore to be construed 
accordingly, with euch usee and effects esploying 
modified or equivalent factors being part of the 
invention. 

5 The huaan peptide sequences described above and 

presented in Figs. 30, 31, 32 and 33, 8BQ ID Nos. 132- 
146, respectively, represent a series of splicing 
variants vhich can be isolated as full length 
coi^lementary nns (cMAS) fros natural sources (cDHA 
10 libraries prepared froa the appropriate tissues) or can 
be assembled as DHA constructs with individual axons 
(e.g., derived as separate exons) by someone skilled in 
the art. 

The invention also includes a method of making a 
15 medicament for treating muscle cells, .i*o*# for inducing 
muscular mitogenesis, myogenesis, differentiaticm, or 
survival, by administering an effective amount of a 
polypeptide as defined above. Such a medicament is made 
by administering the polypeptide with a i^raaceutically 
20 effective carrier. 

another aspect of the invention is the use of a 
pharmaceutical or veterinary formulation comprising any 
factor as defined above formulated for pharmaceutical or 
veterinary use, respectively, optionally together vith an 
25 acceptable diluent, carrier or excipient and/or in unit 
dosage form. In using the factors of the invention, 
conventional j^rmaoeutioal or veterinary practice may be 
employed to provide suitable formulations or 
conpositions. 

30 Thus, the formulations to be used as a part of the 

invention can be applied to parenteral administration, 
for example, intravenous, subcutaneous, intramuscular, 
intraorbital, oidithalmie, intraventricular, intracranial, 
intracapsular, intraspinal, intracistemal. 
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Intraparlt neal, topical « intranasal, a r sol, 
scarification, and also oral, buccal, roctal or vaginal 
adninistration. 

T9ia foraulations of this invention may also be 
5 adainistered by the transplantation into the patient of 
host cells eacpressing the DMA encoding polyp^tides idiieh 
are effective for the sethods of the invention or by the 
use of surgical iq^lants vhich release the f onulations 
of the invention. 

10 Parenteral f onnilations say be in the f on of 

liquid solutions or suspensions; for oral adninistration, 
fonmlations nay be in the fon of tablets or capsiiles; 
ar^ for intranasal forsulations, in the fora of powders, 
nasal drops, or aerosols* 

15 Methods veil )cnovn in the art for saJcing 

formulations are to be found in, for example, 
"Remington's Pharaaeetttical sciences." Ponnilations for 
parenteral administration may, for example, cmtain as 
excipients sterile vater or saline, polyalkylene glycols 

20 such as polyethylene glycol, oils of vegetable origin, or 
hydrogenated naphthalenes, biocompatible, biodegradable 
lactide polymer, or polyoxyethylene-*polyoxypropylene 
copolymers may be used to control the release of the 
present factors* Other potentially useful parenteral 

25 delivery systems for the factors include ethylene-vinyl 
acetate copolymer particles, osmotic pumps, implantable 
infusion systems, and liposomes. Formulations for 
ihhalation may contain as excipients, for sjcample, 
lactose, or aay be aqueous solutions containing, for 

30 example, polyoxyethylene*9-lauryl ether, glycoeholate and 
deoxycholate, or aay be oily solutions for administration 
in the form of nasal drops, or as a gel to be applied 
intranasally. . Pomulations for parenteral administration 
may also include glycoeholate for buccal administration. 
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Bethoxysalicylate for rectal adainistration, or citric 
acid for vaginal adninistration. 

The present factors can be used as the sole active 
agents, or can be used in coabination with other active 
5 ingredients, e.g., other grovth factors which could 

facilitate neuronal survival in neurological diseases, or 
peptidase or protease inhibitors. 

The conoentration of the present factors in the 
formulations of the invention vill vary depending upon a 
10 nuBber of issues, including the dosage to be 
administered, and the route of administration. 

In general terms, the factors of this invention 
may be provided in an aqueous physiological buffer 
solution containing about 0.1 to lOt v/v compound for 
15 parenteral administration. General dose ranges are from 
about 1 mg/kg to about 1 g/kg of body weight per day; a 
preferred dose rai^e is from about 0.01 mg/kg to 100 
mg/kg of body veight per day. The preferred dosage to be 
administered is likely to depend upon the type and extent 
20 of progression of the patho^ysiological condition being 
addressed, the overall health of the patient, the make up 
of the formulation, and the route of administration. 

'Oxm polypeptide factors utilized in the methods of 
the invention can also be used as immunogens for making 
25 antibodies, such as monoclonal antibodies, folloving 
standard techniques. These antibodies can, in turn, be 
used for therapeutic or diagnostic purposes. Thus, 
conditions perhaps associated with muscle diseases 
resulting from abnormal levels of the factor may be 
30 tracked by using such antibodies. 2n vi^i^Q techniques 
can be used, employing assays on isolated saaples using 
standard methods. Imaging methods in vhich the 
antibodies are, for example, tagged with radioactive 
isotopes vhich can be imaged outside the body using 
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t ehnlques f r th art of tumor laaging nay also be 
aqployed. 

A further general aspect of the Invention is the 
use of a factor of the invention in the aanuf acture of a 
5 nedicasent, preferably for the treatnent of a suscular 
disease or disorder. The "GGF2" designation is used for 
all clones vfalch vere previously isolated with peptide 
sequence data derived from G6F*II protein (i.e., 
6GP2HBS5, 66P2BPP3) and, vhen present alone (i.e., 66P2 

10 or rh66F2), to indicate reconbinant huaan protein encoded 
by plasBids isolated vitb peptide sequence data derived 
froB the G6F-ZZ protein (i.e., as produced in insect 
cells froB the plasaid BBSS) . Reconbinant human GGF from 
the GGFHB85 done is called G6F2, rhG6F2 and G6F2BBS5 

15 polypeptide. 

Treating as used herein means any administration 
of the coBpounds described herein for the purpose of 
increasing muscle cell mitogenesis, survival, and/or 
differentiation, and/inr decreasing muscle atrophy and 

20 degeneration. Most preferably, the treating is for the 
purpose of reducing or diminishing the symptoms or 
progression of a disease or disorder of the muscle cells. 
Treating as used herein also means the administration of 
the compounds for increasing or altering the muscle cells 

25 in healthy individuals. The treating may be brought 
about by the contacing of the muscle cells which are 
sensitive or responsive to the coq>ounds described herein 
with an effective amount of the compound, as described 
above. Inhibitors of the covounds described herein may 

30 also be used to halt or slow diseases of muscle cell 
proliferation. 
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Briaf Daaeription of tha Dravinaa 
The drawings vill first ba describad. 
Dravinqa 

Fig. 1 is a graph shoving tha rasulta af rhG6F2 in 
5 a myoblast mitoganasis assay. 

Fig. 2 is a graph shoving tha affact of rhfiGF2 on 
tha nu&bar of nuclai in ayotubas. 

Fig. 3 is a graph of a survival assay shoving tha 
affaet of rh66F2 on survival of diffaraatiatad syotubas. 
10 Fig, 4 is a graph of survival assays shoving tha 

af f act of rhG6F2 on dif f arantiatad ayotubas ralativa to 
huaan platalat darivad grovth factor, huaan fibroblast 
grovth factor, huaan apidanal grovth factor, huaan 
laueocyta inhibitory factor, and huaan insulin-lika 
15 grovth faetora X and ZZ. 

Fig. 5 la a graph shoving tha incraaaad survival 
on Dttohanna auacular dyatrophy oalla in tha prasanca of 
rhG6F2. Fig. 6 ia a graph of ineraasing huaan growth 
homona (h6H) axprassion in C2 calls froa an h<SB raportar 
20 gana undar control of tha AdiR dalta subunit 

transcriptional control alaaanta. Thia incraaaa is tiad 
to tha addition of G6F2 to tha aadia. 

Fig. 7 is a graph of ineraasing hGH raportar 
synthasis and bungarotoxin (BTX) binding to AehRs 
25 following tha addition of ineraasing aaounts of G6F2 to 
C2 calla. 

Figa* 8^ 9, 10 and 11 ara tha paptlda aaquancas 
darivad froa GGF-Z and GOF-ZZ, SEQ ZO Mos. 1-20, 22-29, 
32-50 and 165, (saa Exaaplas 11-13 harainaf ter) . 
30 Fig. 9, Panal A, is tha saquaneas of GGF-Z 

paptidas uaad to dasign daganarata oligonuclaotida probas 
and daganarata PGR priaars ara listad (SEQ ZD Nos. 1, 17 
and 22-29) • Soaa of the sequences in Panel A vere also 
lisad to daaign synthetic paptidas. Panel B is a listing 
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of the sequences of novel peptid s that were too short 
(less than 6 aalno acids) for the design of degenerate 
probes or degenerate PCR primers (SEQ ID Nos. 17 and 32) ; 

Pig. 11, Panel A, ia a listing of the sequences of 

5 GGP-IX peptides used to design degenerate oligonucleotide 
prdbes and degenerate PCR priners (SEQ ID Nos. 42-49) . 
seme of the sequences in Panel A vere used to design 
synthetic peptides. Panel B is a listing of the novel 
peptide that vas too short (less than 6 anlne acids) for 

10 the design of degenerate probes or degenerate PCR priaers 
(8IQ ZD He. SO) ; 

Pigs. 12, 13A, 13B, 14, IS, 16, 17, 18, and 19 
relate to Bxaaqple 8, belov, and depict the aitogenic 
activity of factors of the invention; 

15 Pigs. 20, 21, 22, 23, 24, 25, 26, and 27 relate to 

Eica^le 10, belov and are briefly described below: 

Pig. 20 is a listing of the degenerate 
oligonucleotide probes (SBQ ID Mos. 51-84) designed from 
the novel peptide sequences in Pigure 7, Panel A and 

20 Pigure 9, Panel A; 

Pig. 21 (8BQ ID Ho. 85) depicts a stretch of the 
putative bovine G6P-IZ gene sequence fron the reccabinant 
bovine genoaic ^ge G6P2B61, containing the binding site 
of degenerato oligonucleotide probes 609 and 650 (see 

25 Pigure 18, SEQ ID HQs. 66 and 69, respectively). The 
figure is the coding strand of the DHA sequence and the 
deduced aaino acid sequence in the third reading fraae. 
The sequence of peptide 12 froa factor 2 (bold) is part 
of a 66 aaino acid open reading fraae (nucleotides 

30 75272); 

Pig. 22 is the degenerate PCR priaers (Panel A, 
SEQ IS Hos. 86-104) and unique PCR primers (Panel B, SEQ 
ID Nos. 105-115) used in experiments to isolate segaents 
of the bovine 6CP-II coding sequences present in RHA froa 
35 posterior pituitary; 
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Pig. 23 depicts of tha nine distinct contiguous 
bovine 66F-ZZ cDNA structures and sequences that were 
obtained in PGR amplification esqperiaents. The top line 
of the Figure is a sdieaatic of the coding sequences 
5 vhich contribute to the CDNA structures that vere 
characterised; 

Pig. 34 is a physical up of bovine recoabinant 
phage of GGP2B61. The bovine fragaent is roughly 20 hb 
in length and contains two axons (bold) of the bovine 

10 G6P-ZZ gene. lUMtrietion sites for the ensyaes Xbal, 
Spel, Mdel, BcoRZ, Kpnl, and SstI have been placed on 
this physical aap. Shaded portions correspond to 
fragments which vere subcloned for sequencing; 

Pig. 25 is a seheaatic of the structure of three 

15 alternative gene products of the putative bovine G6P--ZZ 
gene. Bxons are listed A through B in the order of their 
discovery. Vhe alternative splicing patterns 1, 2 and 3 
generate three overlapping deduced protein structures 
(GGP2BPP1, 2, and 3), vhich are displayed in the various 

20 Figs. 27A, 27B, 27C (described belov) ; 

Fig. 26 (SBQ ZD Mos. 116-128) is a ceaqparison of 
the GGF*Z and GGF-ZZ sequences identified in the deduced 
protein sequences shown in Figs. 27A, 27B, 27C (described 
belov) vith the novel peptide sequences listed in Figs. 9 

25 and 11. The Figure shovs that six of the nine novel 
G6F-ZZ peptide sequences are accounted for in these 
deduced protein sequences. Tvo peptide sequences similar 
to 66F-1 sequences are also found; 

Fig. 27 (8BQ ZD No. 129) is a listing of the 

30 coding strand DNA sequence and deduced amino acid 
sequence of the eDHA obtained from splicing pattern 
number 1 in Figure 25. This partial cDNA of the putative 
bovine GGF-IZ gene encodes a protein of 206 amino acids 
in length. Peptides in bold vere those identified from 
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th lists presented in Pigs. 9 and II. P tential 
glycosylation sites are underlined (along with 
polyadenylation signal AATAAA); 

Fig. 27 (SBQ ZD No. 130) is a listing of the 
5 ooding strand DWi sequenee and deduced aaino acid 
sequence of the tsmh obtained froa splicing pattern 
nuBber 2 in Fig. 25. This partial cWh of the putative 
bovine 66F*IZ gene encodes a protein of 281 aaino acids 
in length. Peptides in bold are those identified from 

10 the lists presented in Figs. 7 and 9. Potential 
glycosylatimi sites ere underlined (almg vith 
polyadenylation signal MTAAA); 

Fig. 27 (SBQ ZD Ho. 131) is a listing of the 
ooding strand DMA sequence and deduced aaino acid 

15 sequence of the cDKA obtained froa splicing pattern 

nuaber 3 in Fig. 25. This partial cDWi of the putative 
bovine G6F-ZZ gene encodes a protein of 257 aaino acids 
in length. Peptides in bold are those identified frw 
the lists in Figs. 9 and 11. Potential glycosylation 

20 sites are underlined (along vith polyadenylation signal 
AATAM) . 

Fig. 26, vhich relates to Sxaaple 16 hereinafter, 
is an autoradiograa of a cross hybridisation analysis of 
putative bovine GOF-ZZ gene sequences to a variety of 

25 aaaaalian DKHs on a southern blot. The filter contains 
lanes of BeoRZ«digested DKA (5 nq per lane) froa the 
^ecies listed In the Figure. The probe detects a single 
strong band in each DMA saaple, including a four Icilobase 
f ragaent in the bovine DMA as anticipated by the physical 

30 aap in Fig. 24. Bands of relatively ainor intensity are 
observed as veil, vhich could represent related DMA 
sequences. The strong hybridising band froa each of the 
other aaaaalian DMA saaples presuaably r^resents the 
G6F«*ZZ hoaologue of those species. 
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Fig. 29 la a diagraa of rapreaantatlva apl icing 
varlanta. Tha coding aegnanta ara rapraaantad tfy ?, E, 
B, A, 6, C, C/D, C/0', D\ K and L. Tte location 
of tha paptlda aaquaneaa darivad from purlf lad protaln 
5 ara indlcatad by •o^. 

Fig. 30 (SEQ ID N06. 136-143, 156, 157, 169-178) 
la a Hating of tha DMA aaquaneaa and pradictad paptlda 
urnqftmnemm of tha coding aagmanta of G6F. Lina 1 ia a 
Hating of tha pradictad a&ino add aaguancaa of bovina 
10 G6F, llna 3 ia a liatlng of tha naclaotida aaquaneaa of 
bovina 6GF, llna 3 la a liatlng of tha nuclaotlda 
aaquaneaa of huaan 6GF (heragulln) (nuclaotlda baaa 
utchaa ara indlcatad with a vartlcal llna) and llna 4 la 
a Hating of tha pradictad aalno add aaquaneaa of human 
15 GGF/haragulin vhara it dif f ara from tha pradictad bovina 
aaquanca. Coding aagmanta 8, A' and K rapraaant only tha 
bovina aaquaneaa. coding aagmant D' rapraaanta only tha 
human (haragulln) aaquanca. 

Fig. 31 (S8Q ID Ko. 144) la tha pradictad G6F2 
20 amino add aaquanca and nuclaotlda aaquanca of BPP5. Tha 
uppar lina ia tha nuclaotlda aaquanca and tha lowar llna 
la tha pradictad amino acid aaquanca. 

Fig. 32 (SBQ ID No. 149) la tha pradictad amino 
acid aaquanca and nuclaotlda aaquanca of G6F2BPP2. Tha 
25 iippar lina la tha nuclaotlda aaquanca and tha lovar lina 
ia tha pradictad amino add aaquanca. 

Fig. 33 (SBQ ID MO. 146) l8 tha pradictad amino 
add aaquanca and nuclaotlda aaquanca of GGF2BPP4. Tha 
uppar llna ia tha nuelaotlda aaquanca and tha lovar lina 
30 ia thd pradictad amino acid aaquanca. 

Fig. 34 (SBQ ID Noa. 147-149) dapicta tha 
alignmant of two G6F paptlda aaquaneaa (GGF2BPP4 and 
GGF2BPPS) vlth tha human BGF (hEGF) . Aatariaka indlcata 
poaitiona of eonaarvad cyatainaa. 
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Pl9. 35 depicts the 1 v 1 of 66F activity (Schwann 
cell altogenlo assay) and tyrosine phosphorylation of a 
ca. 200kD protein (Intensity of a 200 XD band on an 
autoradlogran of a Western blot developed vlth an 
5 antiphosphotyroslne polyclonal antibody) In response to 
increasing anounts of GGF. 

Figp 36 is a list of splicing variants derived 
froB the sequences shovn in Pig. 30. 

Fig. 37 is the predicted amino acid sequence, 
10 bottom, and nucleic sequence, top, of ECFU (SEQ ZD Ho. 

150) . 

Fig. 38 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of BGFIi2 (SBQ ID No. 

151) . 

15 Fig. 39 is the predicted amino acid sequence, 

bottom, and nucleic sequence, top, of B6FL3 (SBQ ID No. 

152) . 

Fig. 40 is the j^edioted amino acid sequence, 
bottom, and nucleic sequence, top, of EGFM (SBQ ZD No. 
20 153) . 

Pig. 41 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of B6FL5 (SBQ ZD No. 
154). 

Fig. 42 is the predicted amiiio acid sequence, 
25 bottom, and nucleic sequence, tpp, of BGFX»6 (SEQ ZD No. 
159). 

Fig. 43 is a scale coding segment map of the 
clone. T3 refers to the bacteriophage promoter used to 
produce tfRRa from the clone. R » flanking BcoRZ 
30 restriction enzyme sites. 5' UT refers to the 5' 

untranslated region. E, B, A, C, C/D' , and D refer to 
the coding segments. O - the translation start site. A 
» the 5' limit of the region homologous to the bovine E 
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s gaent (see Example 17) and 3' OT refers t th 3' 
untranslated region* 

7lg« 44 is the predicted aaino aeid sequence 
(middle) and nucleic sequence (top) of G6F2HBS5 (SEQ ID 
5 No. 21) • Hie bottom (intermittent) sequence represents 
peptide sequences derived from GGP-XI j^parations (see 
Pigs. 9, 9). 

Pig. 45 (A) is a graph shoving the purification of 
rCGP on cation lucchange column lay fraction; Pig. 45 (B) 
10 is a photograph of a Hestem blot using fractions as 
depicted in (A) and a G6FII specific antibody. 

Fig. 46 is the sequence of the G6FHBS5, 66FHFB1 
and G6FBPP5 polypeptides (SEQ ID NOS: 166, 167, and 168). 

Fig. 47 is a map of the plasmid pcDRRFpolyA. 

15 PatftHod PggarlptiQn 

The invention pertains to the use of isolated and 
purified neuregulin factors and DMA sequences encoding 
these factors, regulatory coapounds vhitih increase the 
esctramuscular concentrations of these factors, and 
20 coapounds which are mimetics of these factors for the 
induction of muscle cell mitogenesis, differentiation, 
and survival of the muscle cells in vivo and in vitro. 

It is evident that the gene encoding G6F/pl85*^^ 
binding neuregulin proteins produces a number of 
25 variably-sised, differentially-spliced RNA transcripts 
that give rise to a series of proteins. These proteins 
are of different lengths and contain some comaon peptide 
sequences and some unique peptide sequences • The 
conclusion that these factors are encoded by a single 
30 gene is supported by the differentially-spliced RNA 
. sequences irtiich are recoverable from bovine posterior 
pituitary and human breast cancer cells (llDA-ICB-231)) . 
. Further support for this conclusion derives from the size 
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rang of proteins which act as both aitogens for suscle 
tissue (as disclosed herein) and as ligands for the 
pl85**^^ receptor (see below)* 

Further evidence to support the fact that the 
5 genes encoding GGF/pias*'^ binding proteins are 

hoBologous cones from nucleotide sequence ooaparison. 
Holaes et al«, (Science 25fis 1205-1210, 1992) denonstrate 
the purification of a 45«lcilodalt6n huaan protein 
(Heregulin-tt) whidh specifically interacts with the 

10 receptor protein plSS**^. Poles et al« (Cell SSLz 205 
(1992)) and Wen et al. (Cell £i:559 (1992)) describe a 
cospleaentary axh isolated from rat cells encoding a 
protein called "neu differentiation factor" (NDF) • the 
translation product of the NDF cDKA has plSS*'^^ binding 

15 activity. Several other groups have reported the 

purification of proteins of various nblecular weights 
with pl85*>^^ binding activity. These groups include 
Lupu et al. ((1992) Froc. Rati. Acad. Sci. USA 89:2287); 
Yarden and Poles ((1991) Bipdhenistry 30:3543); Lopu et 

20 al. ((1990) science 249:1552)); Dobashi et al. ((1991) 
Biochen. Blophys. Res. Cons. 179:1536); and Huang et al. 
((1992) J. Biol. Chen. 257:11508-11512). 

We have found that pl85*>^^ receptor binding 
proteins stiaulate suscle cell mitogenesis and hence,. 

25 stisulates qrotube formation (nyogenesis) . This 

stisulation results in increased fomation of syoblasts 
and increased fonation of lyotubes (myogenesis) • The 
conpounds described herein also stinulate increased 
Buscle growth, differentiation, and survival of suscle 

30 cellSi These ligands include, but are not lisited to the 
GGF's, the neuregulins, the heregulins, NDF, and AHZA. 
AS a result of this sitogenic activity, these proteins, 
DNA encoding these proteins, and related compounds say be 
administered to patients suffering from traumatic damage 
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r dlsaaaas t th Buscle tissu • It is understood that 
all aathbds provided for the purpose of aitogenesls are 
useful for the purpose of ayogenesis. Inhibitors of 
these ligands (such as antibodies or peptide fragsents) 
5 nay be adainistered for the treatnent of auscle derived 
tuaors. 

Vhese coaqxiunds may be obtained using the 
protocols described herein (Bxaaples 9-17) and in Holnes 
et al., Soieaoe 25fi: 1205 (1992); Poles et al., Cmll 

10 fi2:205 (1992); Wen et al.. Cell £2:559 (1992); Lupu et 
al., Proc. Natl. Acad. sci. USA 89:2287 (1992); Yarden 
and Poles, Biocbeaistzy 30:3543 (1991); Lupu et al.. 
Science 249:1552 (1990); Dobashi et al., Bioeheta. 
Biopbya. Bee. Cbaa. 179:1536 (1991); Buang et al«, J. 

15 Biol. Cbea. 257:11508-11512 (1992); Nsrchionni et al., 

ffature 362:313, (1993); and in the GOF-ZZZ patent, all of 
which are incorporated herein by reference. The 
sequimces are provided and the characteristics described 
for aany of these coapounds. For sequences see Figs. 8- 

20 11, 20-27C, 29-34, 36-44, and 46. For protein 

characteristics see Figs. 12-19, 28 35, 45A and 45B. 

Coapounds aay be assayed for their usefulness In 
XifiCfi using the aethods provided in the eieaaples belov. 
In vivo testing aay be perf oraed as described in Exaaple 

25 1 and in Sklar et al.. In Vitro Cellular and 
Develppaental Biology 27A: 433-434, 1991. 

Pthflir Bahffdiinflntn 

The invention includes aethods for the use of any 
protein vhich is substantially hoaologous to the coding 
30 segaents in Fig. 30 (SEQ ID Nos. 132-143, 156, 1576-147, 
160, and 161) as veil as other naturally occurring GGF 
polypeptides for the purpose of inducing auscle 
aitogenesis. Also included are the use of: allelic 
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variations; natural autants; indue d nutants; pr tains 
•ncodad by DNA that hybridisaa undar high or lov 
stringency conditions to a nucleic acid naturally 
occurring (for definitions of high and lov stringency see 
5 current Protocols in Mtoleeular Biology, Jdhn Wiley « 
sons, New YorTa, X989, 6.3.1 - 6.3.6, hereby incorporated 
by reference) ; and the use of polypeptides or proteins 
specifically bound by antisera to G6F polypeptides. The 
ters also includes the use of chiaeric polypeptides that 

10 include the G6F polypeptides conprising sequences fron 
Pig. 28 for the induction of auscle nltogenesis. 

AS vill be seen froa Exaaple 8, belov, the present 
factors exhibit aitogenic activity on a range of cell 
types. The general stateaents of invention above in 

15 relation to foraulations and/or aedicaaents and their 
aanufacture should olsarly be construed to include 
appropriate products and uses. 

A series of experiaents f ollov vhich provide 
additional basis for the claias described herein. The 

20 following exaaples relating to the present invention 
should not be construed as specifically liaiting the 
invention, or such variations of the invention, nov known 
or later developed. 

The exaaples illustrate our discovery that 

2S recoabinant huaan 66F2 (rhG6F2) confers several effects 
on priaary huaan auscle culture. rhGGF2 has significant 
effects in three independent biological activity assays 
on auscle cultures. The polypeptide increased 
aitogenesis as aeasured by proliferation of subconf luent 

30 quiescent ayoblasts, inoreased differentiation by 

confluent myoblasts in the presence of growth factor, and 
increased survival of differentiated ayotubes as aeasured 
hf loss of dye exclusion and increased acetylcholine 
receptor synthesis. These activities indicate efficacy 
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Of GCP2 and other neuregulins in inducing auscle repair, 
regeneration, and prophylactic effects on auscle 
degeneration. 

BXMgliB I 

5 Mitoqenic Activity ef rhGGV Hvoblnflf^fl 

Clone G6F2HBS5 vas expressed in recombinant 
Baculovirus infected insect cells as described in 
Bxaaple 14, infra, and the resultant recoabinant huaan 
G6F2 vas added to myoblasts in culture (conditioned 

10 aediua added at 40 |ftl/al) • Myoblasts (057A cells) were 
grown to preconfluence in a 24 veil dish. Nediua vas 
reaoved and replaced vith DNEM containing 0.5% fetal calf 
serua vith or vithout 66F2 conditioned aediua at a 
concentration of 40 Ml/al. Mediua vas changed after 2 

15 days and cells vere fixed and stained after 5 days. 

Total nuclei vere counted as vere the nuaber of nuclei in 
ayoblasts (Table 1) . 

TABLB 1 





Troataent 


Total NuBber 

of NUClOl/OB^ 


Nuclei in 
Myotubes 


Fusion 
Indole 


20 


Control 


395 i 38.3 


204 ± 9.19 


0.515 i 0.01 




G6F 40Ml/al 


636 ± 8.5 


381 t 82.7 


0.591 1 0.15 



66F treated ayoblasts shoved an increased nuaber of total 
nuclei (636 nuclei) over untreated cmtrols (395 nuclei) 
indicating aitogenic activity. rhG6F2 treated ayotubes 
25 had a greater nuaber of nuclei (381 nuclei) than 

untreated controls (204 nuclei). Thus, rh66F2 enhances 
the total nuaber of nuclei through proliferation and 
increased cell survival. rhG6F2 is also likely to 
enhance the foraation of ayotubes. 
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The aitog nic activity of rhGGF2 may be measured 
In vivo by giving a continuous supply of GGF2 and 

thymidine to rat muscle via an osmotic mini pump. 
The muscle bulk is determined 1^ vat weight after one and 
5 two veeks of treatment. DNA replication is measured by 
counting labeled nuclei in sections after coating for 
autoradiography (Sklar et al*. In Vitro Cellular and 
Developmental Biology 27A: 433*434, 1991) in sham and 
rhG6F2-treated muscle • Denervated muscle is also 
10 examined in this rat animal model via these methods and 
this method allows the assessment of the role of rha6P2 
in muscle atrophy and repair. Mean fiber diameter can 
also be used for assessing effects of FGP on inrevention 
of atrophy. 



15 EXMgLB 2 

Btfgct 9t rhoara an unrein CtH MitggmMii 

Quiescent primary clonal human myoblasts vere 
prepared as previously described (Sklar, R., Budson, A., 
Brown, R. , In vitro C0llular and DevelopDeatal Biology 

20 1991; 27A:433*434) . The quiescent cells were treated 

with the indicated agents (rhG6F2 conditioned media, PD6F 
with and without methylprednisolone, and control media) 
in the presence of lOiiN BrdU, 0.5% FCS in IBIEK. After 
two days the cells were fixed in 4% paraformaldehyde in 

25 PBS for 30 minutes, and washed with 70% ethanol. The 
cells were then incubated with an anti*BrdU antibody, 
imshed, and antibody binding was visualized with a 
peroxidase reaction. She number of staining nuclei vere 
then quantified per area. The results show that GGF2 

30 Induces an increase in the number of labelled nuclei per 
curea over controls (see Table 2). 
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nsLB a 



Mltogmlo Effects of G6F on Huun Kyoblasts 



TroatMnt 


Labollad , 
maclol/cir 


T-Tast 
p valuo 


Control 


120 ± 22.4 




Znf octod Control 


103 ± 11.9 




G6P 5 Ml/al 


223 ± 33.8 


0.019 


PDGF 20 ng/Bl 


418 ± 45.8 


0.0005 


I6PZ 30 na/al 


280 ± 109.6 


0.068 


Notliylprodniaolono 1.0 fiM 


142 i 20.7 


0.293 



10 Platelet derived growth factor (PDGF) was used as a 

positive control, Kethylprednlsolone (a corticosteroid) 
vas also used in addition to rhGGP2 and showed no 
significant increase in labelling of UHh. 

rhGGF2 purified to honogeneity (>95% i»ure) is also 
15 aitogenic for hunan nyoblasts (Fig. 1}/ 

Recombinant hunan GGF2 also causes nitogenesis of 
priaary human myoblasts (see Table 2 and Fig, 1) • The 
nitogenesis assay is perfomed as described above; The 
. nitotic index is then calculated tv dividing the nuaber 
20 of BrdU positive cells by the total number of cells. 

EXMIFLE 3 

f ftet of rhCCP2 on Muaele Cull Pif fegentiation 

The effects of purified rhG6F2 (95% pure) on 
auscle culture differentiation were examined (Fig. 2) • 
25 confluent ayoblast cultures were induced to differentiate 
by levering the serua content of the culture aedlua froa 
20% to 0.5%. The test cultures were treated with the 
indicated concentration of rhG6F2 for six days, 
refreshing the culture aedium every 2 days. The cultures 
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ver then fixed, stained, and the nuaber of nucl i 
counted per ailliaeter. The data in Fig. 2 denonetrate a 
large increaee in the nuaber of nuclei in ayotubee when 
rh6GF2 is present, relative to controls. 

5 BXmPLB 4 

^^mm^ rttcsya on thm flurvival of Differentiated 
Myotiibes 

The survival of differentiated ayotubes was 
significantly increased by rhG6F2 treataent. Muscle 

10 cultures vers differentiated in the presence of rhG6F2 
and at various tiaes the nuaber of dead ^otubes vers 
counted }3Y prbpidiua iodide staining. As can be seen in 
Fig. 3, the nuaber of dead ayotubes is lover in the 
rhG6F2 treated culture at 4, 5, 6, and 8 days of 

15 differentiation. The nuaber of nuclei in ayotubes was 
significantly increased by G6F2 treataeat coapared to 
untreated cultures after 8 days of differentiation, 
specifically, the control tfhoved 8.6 aypauelei/aa^, vhile 
rhfi6F2 treated cultures shoved S7.2 ayonuclei/aa^ 

20 (p-0.035) vhen counted on the saae plates after geiasa 
staining. 

The survival assay was also perf oraed vith other 
grovth factors vhich have knovn effects on ausole 
culture. The rhOOF2 effect vas unique aaong the grovth 

25 factors tested (Fig. 4) . In this eacperiaent cultures 
vere treated in parallel vith the rhGGF2 treated plates 
vith the indicated concentrations of the various grovth 
factors. Survival of ayotubes vas aeasured as above at 8 
days of differentiation of 057A ayoblast cells. 

30 Cmcentrations of factors vere as follovs: rhGGF2: 

lOOng/al; huaan platelet derived grovth factor: 20ng/al; 
huaan basic fibroblast grovth factor: 25ng/al; huaan 
epideraal grovth factor: 30ng/al; huaan leucocyte 
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Inhibitory factor: lOng/ml; huaan insulin like growth 
factor Z: 30ng/al; huaan insulin like growth factor II: 
25ng/Bl. 

The observed protection of dif ferentiatied ayotubes 
5 from death indicates that rhGGP2 has proaise as a therapy 
for intervention of auscle degeneration characterized by 
nuaerous auscle diseases. Thus, agents which increase 
the esetraauscular concentration of neuregulins aay have a 
prqphylactie effect or slow the progress of auscle** 
10 wasting diswders and increase rates of auscle 

differentiation, repair* conditioning, and regeneration. 

BtftHPM g 

rhggr2 Promotes survival of niffiiri.fi<.i^eed nvotubi^a w4i.h 
e genetic Defect at the DttChaana MumculMr Dvatrophv I^oeufl 

iS The positive effects of rhG6F2 on ayotube survival 

could reflect potential efficacy in degenerative 
disorders • These effects on ayotube survival were tested 
on a clonally**derived priaary Duchenne ayoblast to 
deteraine if the response observed in noraal auscle 

20 culture could also be deaonstrated in cultures derived 
froa diseased individuals. The data presented in Fig. 5 
was obtained using the saae auscle culture conditions 
(Bxaa^le 4, above) used for norxaal individual. rh66F2 
significantly decreased the nuaber of dead ayotubes in 

25 the differentiated Duchenne auscle culture, coapared to 
controls (p»0.032). Concentrations were as follows: 
GGP2: loong/al; huaan platelet derived growth factor: 
20ng/al; huaan insulin like growth factor I: 30ng/al. 

This exaaple demonstrates that rhG6P2 can also 

30 proaote survival of differentiated Duchenne ayotubes and 
provides strong evidence that rhGGF2 aay slow or prevent 
the course of auscle degeneration and wasting in aaaaals. 
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gXftMPLB g 

rhCCF2 Effect on lAii Pif farantlation Proorang Induction 
of MHC fllov and Dvtrophln Proteins 

Sbe effects of purified rb66F2 on nusele culture 
5 differentiation vas also exaained by Western analysis of 
culture lysates* The levels of nusele specific proteins 
vera determined in triplicate treated and untreated 
cultures. These cultures vere prepared and treated as 
above except that the plate size %ras increased to 150 mm 

10 and the ouscle culture layer vas scraped off for Western 
analysis as described in Sfclar, and Brovn, R* (J. 
Hmurol. Sci. 101:73-81, 1991). The results presented in 
Table A indicate that rhG6P2 treatment increases the 
levels of several muscle specific proteins, including 

15 dystrophin, myosin heavy chain (HBC, adult slov and fast 
isoforms), but does not increase the levels of BSP72 or 
HBC neonate isof orm to a similar level per amount of 
protein loaded on the Western. The levels of muscle 
specific proteins induced by rhGGF2 irere similar to the 

20 quantitative increases in the number of myonuclei/mm^ 
(Table 3). 
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Control ±SD rbGGF2 Troat- p 

value 

Bent ±SD 



Total Protein (ug) 


554 ± 38.4 


798 ± 73.6 


0.007 


Kyenuelol/n^ 


29.0 t 12.2 


106 ± 24.1 


0.008 


NEC fast/M9 protoln 


1.22 ± 0.47 


4.00 ± 0.40 


0.001 


MHC slow/fig protoln 


0.17 ± 0.13 


1.66 1 0.27 


0.001 


MHC neonate/itg protein 


0.30 t 0.27 


0.55 ± 0.04 


0.199 


dystrophin/Mg protein 


6.67 1 0.37 


25.5 ± 11.0 


0.042 


BSP 72/M9 protein 


3.30 t 0.42 


4.54 ± 0.08 


0.008 



The rh6GP2 dependent Increaee in the adult ayoeln 
heavy chain ieofons (alov is found in type Z human 
muscle fibers; fast is found in type 2A and 2B human 

15 muscle fibers) may represent a maturatimi of the 

myotubesi as the neonatal isoform was not significantly 
increased by rhG0F2 treatment. During rat muscle 
development MHC isof orms switch from fetal to neonatal 
forms followed by a switch to mature adult slow and fast 

20 MHC isof orms (Feriasamy at al. J. fiiol. Cham. 259:13573* 
13578, 1984; Feriasamy at al. J. Biol. Cbem. 260:15856- 
15862, 1985; Wiecsorek et al. J. Cell Biol. 101:618-629, 
1985) • While muscle can autonomously undergo some of 
these isoform transitions in the absence of neural cells 

25 or tissue, mouse muscle explants express the adult fast 
MHC isoform only when cultured in the presence of mouse 
spinal cord (Bcob-Frinee et al. J. cell Biol. 103:995- 
1005, 1986) . Additional evidence that MHC isoform 
transitions are influenced by nerve was established by 

30 Ittialen et al. (Oeve. Biol. 141:24-40, 1990); after 

regeneration of notexin treated rat soleus muscles only 
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th adult fast NHC Isofonn was produead In th new 
denervated suacle, but Innarvated regenerated nuscle made 
both fast and slow adult MHC Isofons. Thus the 
deaonstratlon in Table 3 that rhG6P2 inoreeses the 
5 synthesie of adult MBC isof orss indicates that rh66P2 aay 
induce a developmental aaturation of Buscle which nay 
mimic neuronal innervation* 

BXMgLE 7 

Maugemilins, infiludinn ghCeW- indiifie tHe svntHesia of 

10 aMtYlctoQltM wgflPtMTg in w»glat 

The expression of acetylcholine receptor (AehR) 
subunit proteins can be induced by exposing muscle cells 
to neuregulins. norm specifically, ve have sbo%m that 
contacting muscle cells with rhG6P2 can induce the 

15 synthesis of AchB subunit proteins. This induction 
following rhG6P2 exposure vas observed in two wayss 
first, we detected increased expression of human growth 
hormone via the product of a reporter gene construct and 
second we detected increased binding of al|Aia« 

20 bungarotoxin to cells. 

In the following example a mouse myoblast cell 
line G2 was used. C2 cells were transfected with a 
transgene that contained the 5' regulatory sequences of 
the AChR delta subunit gene of mouse linXed to a human 

25 growth hormone full-length oDNA (Baldwin and Burden, 
1988. J. Cell Biol. 107:2271-2279). This reporter 
construct allmrs the measurement of the induction of AChR 
delta gene expression by assaying the quantity of growth 
hormone secreted into the media. The line can be induced 

30 to form myotubas by lowering serum concentration in the 
media from 20% to 0.5%. 

Specifically, mouse C2 myoblasts transfected with 
an AChR-human growth hormone reporter construct and were 
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assayed for expr ssl n f hGH foil wing trsataent with 
rliG6F2. Thm results of tvo separate experinents are 
suBBarised in Table 4 and in Figures 6 (hGH exptession) 
and 7 (hCH expression and alpha-^bungarotoxin binding) • 
5 Shown are the dose response curves for secreted huaan 
growth homone and for bungarotoxin binding f roa muscle 
cultures treated with rhGGFS. 



TBMLM 4 

Effects of rh66F2 on the expression of AGhR delta 
10 subunit/hCT transgene and the synthesis of AChR 

BtBU tXSLl 

G6F hOB hGH AChR 

15 (ul) (ng/Bl) (ng/ml) (qpa/ng protein) 



0 9.3 -f 2.1 5.7 ^ 2.1 822 4- 170 

0.1 - 8.8 1.5 891 134 

0.5 • 12.0 0.9 993 -I- 35 

1.0 - 9.7 + 2.3 818 + 67 

20 5.0 17.5 *f 2.8 14.7 *f 3.5 1300 -I- 177 

10.0 14.3 3.2 14.1 4 3.3 1388 ^ 137 

15.0 22.0 +1.4 



25 



02 syotubes were treated with cold a-BTX (20 nN) 
for 1 hr. at 37 washed with culture sediuB twice and 
then treated with G6F2. Culture aediUB was adjusted with 
30 bovine serun albumin at the concentration of 1 ng/ml. 24 
hours later, culture medium was removed and saved for. hGH 
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assay. Hascls culturas war treated with ^^'l-o-BIX (20 
iiM) for 1 hoar at 37»C, washed and scraped In PBS 
containing i% SDS. Hen-speelfie binding was deterBined 
in the presence of cold o-BZX (40 nK) . ihe cell 
5 hcBogenate was counted for radioactivity and assayed for 
total, protein aaount. 

The presence of rhG6P2 led to a greater than 2- 
fold increase in hoa gene es^resslon, thereby indicating 
that tboan induced the synthesis of the delta subunit of 
10 the acetylcholine reeq^tor. Purtheraore, increased 

bttngarotexin binding is consistent with asseably of these 
subunit proteins into functional acetylcholine receptors. 
To strenthen the interpretation of these data the 
analysis was repeated on cultures that had the h6H 
15 reporter linked to a netallothiene prorotor, uhlch should 
not be responsive to rh66P2. The results of that control 
experlMnt showed that the hGB response was aadlated 
through transcriptional activation of the AohR delta 
subunit gene control elenents. 
20 These results indicate that rhG6P2 could be useful 

in replMiishing AdiRs as part of the therapy for the 
autoioBune disease Myasthenia gravis, this activity aay 
also be beneficial in treataent of peripheral nerve 
regeneration and neur^thy by stimilating a key step in 
25 re-lnnervatlon of anascle. 

Additional Miteaenie Aetlvltlna «f Purified ecF.T 
SSE-ZZ 

The ■itogenie activity of a highly purified saaple 
30 containing both GGPs I and II was studied using a 

quantitative method, which allows a single nleroeulture 
to be exanined for DMA synthesis, cell aorphology, cell 
nuaber and expression of cell antigens. This technique 
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has been aodif led froa a method previously reported by 
Hair et al., analytical Bloche&istry 185, 377-382, 1990. 
The main aodlflcations are: 1) the use of uncoated 
aicrotiter plates, 2) the cell nuaber per veil, 3) the 
5 use of 5% Foetal Bovine Plasaa (FBP) instead of 10% 

Foetal Calf Serum (FCS), and 4) the time of incubation in 
presence of mitogens and broaodeoxyuridine (BrdU) , added 
siaultaneously to the cultures, in addition the cell 
aonolayer vas not washed before fixation to avoid loss of 

10 cells, and the incubation time of aonoclonal mouse 
anti-BrdU antibody and peroxidase conjugated goat 
anti-aouse iaaunoglobulin (XgG) antibody vers doubled to 
increase the sensitivity of the assay. The aissay, 
optimised for rat sciatic nerve SAvann cells, has also 

15 been used for several cell lines, after appropriate 
aodif ications to the cell culture conditions. 

1j Methods gf Mitwrneais ToBtiiw 

On day i, purified Schwann cells were plated onto 
uncoated 96 well plates in 5t FBP/Dulbecco's Modified 

20 Bagle Mediua (DMEM) (5,000 cells/well) . On day 2, GGFs 
or other test factors were added to the cultures, as well 
as BrdU at a final concentration of 10 /la. After 48 
hours (day 4) BrdU incorporation was terainated by 
aspirating the mediua and cells were fixed with 200 

25 Ml/well of 70% ethanol for 20 ain at rooa tepperature. 
Next, the cells were washed with water and the ONA 
denatured by incubation with 100 pi 2N HCl for 10 ain at 
37*0. Following aspiration, residual acid was 
neutralised by filling the wells with 0.1 N borate 

30 buffer, pR 9.0, and the cells were washed with phosphate 
buffered saline (PBS) • Cells were then treated with 50 
Ml of blocking buffer (PBS containing 0.1% Triton X 100 
' and 2% noraal goat serua) for 15 ain at 37 *c. After 
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aspirati n, and nal a usa antl-BrdU antibody (Dako 
Corp., Santa Barbara, CX) (50 Ml/vell, 1.4 M9/al diluted 
In blocking buf far) was added and Incubated for tvo hoxire 
at 37 •C. unbound antibodies vere reaoved by three vaehee 
5 la PBS containing 0.1% Vrlton X-100 and 

peroxldase-conjugated goat antl^aouae IgG antibody (Oako 
Corp., Santa Barbara, CA) (50 iil/vell, 2 ^g/al diluted In 
blocking buffer) tree added and Incubated for one bour at 
37*C. After tbree vaebes In PBS/Triton and a final rinse 

10 In PBS, veils received 100 m1/vs11 of so tfK 

phosphate/citrate buffer, pH 5.0, containing 0.05% of the 
soluble chroBogen o-phenylenedlaalne (OPD) and 0.02% R^Os* 
The reaction vas teralnated after 5*20 aln at rooa 
teaperature, by pipetting 80 iil fron each veil to a clean 

15 plate containing 40 Ml/vell of 29 sulfuric add. Xhe 
abs orb ance vas recorded at 490nB using a plate reader 
(Dynatedi Labs) . me assay plates containing the cell 
miolayers vere vashed tvlce vlth PBS and 
iaaunooytochealcally stained for BrdU-^DHA by adding 100 

20 Ml/vell of the substrate dlaalnobenzldlne (DAB) and 0.02% 
H2O2 to generate an Insoluble product. After 10-20 aln 
the staining reaction vas stopped by vashlag vlth vater, 
and BrdU-posltlve nuclei observed and counted using an 
Inverted alcroscope. occasionally, negative nuclei vere 

25 counterstalned vlth 0.001% Toluldlne blue and counted as 
before. 

XIa — Cell llMB Wtd gflg HitWSMBiB ASSaYS 

SifisB 3T3 rlbroblmatat cells, frea Flov Labs, 
vere aalntalned In SMBN suppleaented vlth 10% PCS, 
30 penicillin and streptoaycln, at 37 *C In a hualdlfled 
ataosphere of 10% COj In air. Cells vere fed or 
subcultured every tvo days. For altogenlc assay, cells 
vere plated at a density of 5,000 cells/veil In coaqplete 
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nediua and inoabated for a weak until cells vera 
confluant and quiescent. The serum containing aediua vas 
renoved and the cell aonolayer vashed tvice vith serua 
free-aediuB. 100 Ml of serua free aediua containing 
5 aitogens and lOpN of BrdU vere added to each veil and 
incubated for 48 hours. Dose responses to GGFs and serua 
or PD6F (as a positive control) vere perforaed. 

BSIC (Baby Hamster XiOamy) 21 C13 Fibroblaatai 
Cells froa European Collection of Aniaal cell Cultures 

10 (BCACC) , vere aaintained in Glasgov Modified Eagle Mediua 
(GMEM) suppleaented vith 5% tryptose phosphate broth, 5% 
PCS, penicillin and streptoaycin, at 37 •c in a huaidified 
ataos^ere of 5t co^ in air. Cells vere fed or 
subcultured every tvo to three days. Per aitogenic 

15 assay, cells vere plated at a density of 2,000 cell/veil 
in coa^lete aediua for 24 hours. The serua containing 
aediua vas then reaoved and after vashing vith serua free 
aediua, replaced vith 100 Ml of 0.1% PCS containing GKBM 
or OIBM alone. GGFs and PCS or bPGP as positive controls 

20 vere edded, coincident vith IOmM BrdU, and incubated for 
48 hours. Cell cultures vere then processed as described 
for schvann cells. 

C5 Aat Glioaa Ceil lilae: Cells, obtained at 
passage 39, vere aaintained in MCEM containing 5% PCS, 5% 

25 Horse serua (H8), penicillin and streptoaycin, at 37 ^C in 
a huaidified ataosphere of 10% CO^ in air. Cells vere 
fed or subcultured every three days. For aitogenic 
assay, cells vere plated at a density of 2,000 cells/ veil 
in coBqplete aediua and incubated for 24 hours. Then 

30 aediua vas replaced vith a aixture of 1:1 MEK and F12 
aediua containing 0.1% PCS, after vashing in serua free 
aediua. Dose responses to GGFs, PCS and aPGF vere then 
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pcrfenMd and eallB ware proc ssad through the BLZSA as 
praviously daseribad for the othar call typas. 

PC12 (R»t AOranal PhaochzoBoeytoaa Cells) t Calls 
froB WCMCe, vara aaintainad in BPMI 1640 sopplaMntad 
s vlth 10% BS, S% Tea, panlelllln and strapteayein« lii 
collagan eoatad flasks, at 37 in a bUBidifiad 
ataosphara of 5% CO^ in air. Calla vara fad avary thraa 
days by replacing 80% of the aadlua. For aitoganic 
assay, calls ware plated at a density of 3,000 calls/well 

10 in collate Mdiua, on collagen coated plataa (50 nl/wll 
collagan, vitrogan Collagen Corp., diluted 1 t SO, 30 ain 
at 37*e) and incubated for 34 hours. She aediUB was then 
placed with fresh RSHI either alone or containing 1 BM 
insulin or 1% PCS. Dose responses to PC8/B8 (1:2) as 

15 positive control and to C6Ps ware performed as before. 
After 48 hours calls ware fixed and the BLZSA perf oned 
aa previeualy describ ed . 

TTT. Ramiito i at ni±aamnm*i» xaaavsi All the experiments 

presented in this Bxanple vere perfonMd using a highly 
20 purified sample fron a Sepharose 12 ebroaatography 

purif Icatim step containing a mixture of G6P-X and 

G6P-ZI (G6FS). 

First, the results obtained vith the BrdU 

incorporation assay vere ceapared vith the classical 
25 aitegenio aaaay for Sehvann cells based on [125]Z-XldR 

incorporation into DMA of dividing calU, described by J. 

P. Broekes (Methods Ensymol, 147:217, 1987). 

Fig. 13 shows the comparison of data obtained vith 

the two assays, perforaed in the aana call culture 
30 conditions (5,000 cells/veil, in 5% FBP/DMBI, incubated 

in presence of CCPs for 48 hrs). As clearly ahown, the 

results are coq>arable, but BrdU incorporation assay 
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appears to be slightly aore sensitive, as suggested by 
tbe shift of the carve to the left of the graph, i.e. to 
lover concentrations of GGFS. 

As described under the section "Methods of 
5 Mitogenesis Testing", after the iaanunoreactive BrdU-DlU^ 
has been quantitated by reading the intensity of the 
soluble product of the OPD peroxidase reaction, the 
original assay plates containing cell aonolayers can 
undergo the second reaction resulting in the insoluble 

10 DM product, «fhich stains the BrdU positive nuclei. The 
nicrocultures can then be examined under an inverted 
microscope, and cell morphology and the nuobcurs of 
BrdU^positive and negative nuclei can be observed. 
In Pig. ISA and Pig. 13B the BrdD-DMA 

IS iamunoreactivity, evaluated by reading absorbance at 490 
nm, is coapared to the number of BrdU-positive nuclei and 
to the percentage of Brda*positive nuclei on the total 
number of cells per veil, counted in the same cultures. 
Standard deviations vere less than 10% * The tvo 

.20 evaluation methods shov a very good correlation and the 
discrepancy betveen the valuee at the hi^^st dose of 
GGPs can be esqplained by the different extent of DMA 
synthesis in cells detected as BrdU«»positive. 

The BrdU incorporation assay can therefore provide 

25 additional useful information about the biological 

activity of polypeptides on Schvann cells tAen compared 
to the (125) X-MR incorporation assay. Por example, the 
data reported in Pig. 15 shov that GOPs can act on 
schvann cells to Induce DMA synthesis, but at lover doses 

30 to increase ths number of negative cells present in the 
microculture after 48 hours. 

The assay has then , been used on several cell lines 
of different origin. Zn Pig. 15 the mltogenic responses 
of Schvann cells and Sviss 3T3 fibroblasts to GGPs are 
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conpared; despita the veaX response btain d in 3T3 
fibroblasts, sone clearly BrdU*positive nuclei vere 
detected in these cultures. Control culttires vere run in 
parallel in presence of several doses of FCS or huaan 
5 reeooibinant FD6F, shoving that the cells could respond to 
an^qpriate stiauli (not shorn). 

. The ability of fibroblasts to respond to GGPs vas 
further investigated using the BHK 21 C13 cell line. 
These fibroblasts, derived from kidney, do not exhibit 

10 contact inhibition or reach a quiescent state vhen 

confluent. Therefore the experiaental conditions vere 
designed to have a very lov background proliferation 
vithout conproBising the cell viability. CGFs have a 
significant nltogenic activity on BHK21 C13 cells as 

15 shewn by Fig. 16 and Fig. 17. Fig. 16 shovs the Brdu 
incorporation into DMA by BHK 31 C13 cells stiaulated by 
G6FS in the j^esence of 0.1% FCS. The good Bitogenic 
re^onse to FCS indicates that cell culture conditions 
vere not limiting. In Fig. 17 the aitogenic effect of 

20 . GGFs is expressed as the number of BrdU-positive and 
BrdU-*negative cells and as the total nunber of cells 
counted per veil. Data are representative of tvo 
experisents run in diqplicates; at least three fields per 
veil vere counted. As observed for schirann cells in 

25 addition to a proliferative effect at iov doses, GGFs 
also increase the nuskbers of nonresponding cells 
surviving. The percentage of BrdU positive cells is 
proportional to the increasing anounts of GGFs added to 
the cultures. The total nunber of cells after 48 hours 

30 in presence of higher doses of GGFs is at least doubled, 
confining that GGFs induce DHA synthesis and 
proliferation in BHK21 C13 cells. Under the saae 
conditions, cells saintained for 48 hours in the presence 
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of 2t FCS Shoved an increase of about six fold (not 
shown) • 

C6 glioma cells have provided a useful model to 
study glial cell properties, the phenotype exinressed 
5 seems to be dependent on the cell passage^ the cells more 
closely resembling an astrocyte phenotype at an early 
stage, and an oligodendrocyte phenotype at later stages 
(beyond passage 70). C6 cells used in these experiments 
vers from passage 39 to passage S2. C6 cells are a 

10 highly proliferating populationi therefore the 

experimental conditions vere optimized to have a very lov 
background of BrdU incorporation. The presence of 0.1% 
serum %ras necessary to maintain cell viability vithout 
significantly affecting the mitogenic responses, as shown 

15 by the dose response to FCS (Pig. 18) . 

In Fig. 19 the mitogenic responses to aFQF (acidic 
Fibroblast growth factor) and GGFs are expressed as the 
percentages oif maxixuil BrdU incorporation obtained in the 
presence of FCS (8%). Values are averages of two 

20 experiments, run in duplicates. The effect of GGFs was 
coBfparable to that of a pure preparation of aFGF* aFGF 
has been described as a specific growth factor for 06 
cells (liim R. et al.. Cell Regulation 1:741-746, 1990) 
and for that reason it was used as a positive control. 

25 The direct counting of BrdU positive and negative cells 
was not possible because of the high cell density in the 
microcultures. Zn contrast to the cell lines so far 
reported, PC12 cells did not show any evident 
responsiveness to GGFS, when treated under culture 

30 conditions in which PC12 could respond to sera (mixture 
of FCS and HS as used routinely for cell maintenance) • 
Nevertheless the number of cells plated per well seems to 
affect the behavior of PC12 cells, and therefore further 
es^eriments are reguired. 
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Amino acid sequences of purif ind gcf^t and ggmt 
Aaino add sequencs analysis studies were 
perf oned using highly purified hovine pituitary G6P*Z 
5 and GOP^'ZZ. The conventional single letter code vas used 
to describe the sequences. Peptides were obtained by 
lysyl endopeptidase and protease vs digests, carried out 
on reduced and carboxyaethylated sasples, with the lysyl 
endopeptidase digest of G6P-ZZ carried out on material 

10 eluted from the 55^65 RD region of a 11% SD8-PA6B (Ml 
relative to the above-^oted markers) . 

A total of 21 peptide sequences (see Fig. 8, 8BQ 
ID Nos. 1--20, 165) were obtained for GGF-Z, of vhich 12 
peptides (see Fig. 9, SBQ ID Nos. 1, 22-29, 17, 19, and 

15 22) are not present in current protein databases and 
therefore represent unique sequences. A total of 12 
peptide sequences (see Fig. 10, SBQ ZD Nos. 42-50 and 
161-163) were obtained for 06F-ZZ, of vhidh 10 prides 
(see Fig. 11, SBQ ZD Nos. 42-50) are not present in 

20 current protein databases and therefore represent unique 
sequences (an exception is peptide GGF-ZZ 06 vhicb shovs 
identical sequences in many proteins vhich are probably 
of no significance given the small number of residues) . 
These novel sequences are extremely likely to correspond 

25 to portions of the true amino acid sequences of GGFs z 
and ZZ. 

Particular attention can be drawn to the sequences 
of GOF-Z 07 and GGF-ZZ 12, vhich are clearly highly 
related, nie similarities indicate that the sequences of 
30 these peptides are almost certainly those of the assigned 
GGF species, and are most unlikely to be derived from 
contaminant proteins. 

Zn addition, in peptide GGF-IZ 02, the sequence X 
* S S is consistent with the presence of an N linked 
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carbohydrate aoiaty on an asparaglne at the position 
denoted hy X« 

In general, in Figs. 8 and 10, X represents an 
unknown residue denoting a sequencing cycle where a 
5 single position could not be called with certainty either 
because there was acre than one signal of equal sise in 
the cycle or because no signal was present. As asterisk 
denotes those peptides where the last amino acid called 
corresponds to the last amino acid present in that 

10 peptide. In the remaining peptides, the eignal strength 
after the last amino acid called was insufficient to 
continue sequence calling to the end of that peptide. The 
rl^t hand column indicates the results of a computer 
database search using the GOG package PASTA and TFASTA 

IS programs to analyse the NBRF and BHBL sequence databases. 
The name of a protein in this column denotes identity of 
a portion of its sequence with the peptide amino acid 
sequence called allowing a maximum of two mismatches. A 
question mark denotes three mismatches allowed. The 

20 abbreviations used are as follows; 

Hlf6-*l Hi^ Mobility Group protein-1 

1016-2 Hi^ lability Group protein-2 

LH-alpha Lutelnlsing hormone alpha subunit 
lil-beta Luteinising hormone beta subunit 

25 mBELSJLft 

Tfloiafeina and Clenimi of Wacleotide Sequences encoding 
tMrafcuina eontainlM CCF-I and CGF^II peptides 

Isolation and cloning of the G6F*-ZZ nucleotide 
sequences was performed as outlined herein, using peptide 
30 sequence infomtion and library . screening, and was 

performed as set out below. It will be appreciated that 
the peptides of Pigs. 10 and II can be used as the 
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Starting polixt for Is lati n and cloning f G6P-Z 
sequencM by following the techniques described herein. 
Indeed, Fig. 20, SEQ ID Nos. 51-84) shows possible 
degenerate oligonucleotide probes for this purpose, and 
5 Fig. 22, 8BQ ZD Hos. 86-115, lists possible PGR priaers. 
DHA sequence and polypeptide sequence should be 
obtainable by this seans as with G6F-II, and also OKA 
constructs and esqpression vectors incorporating such DMA 
sequence, host cells genetically altered by incorporating 
10 such constructs/ vectors, and protein obtainable l»y 
cultivating such host cells, me invention envisages 
sudk subject aatter. 

1m — Pegjqn and synthesis of olioenueleetide Probes sml 
£CiBBEft 

iS Degenerate DMA oligoner probes were designed by 

baektranslating the aaino acid sequences (derived from 
the peptides generated free purified OOF protein) into 
nucleotide sequences. Oligosers represented either the 
coding strand or the non-coding strand of the DHA 

20 sequence. When serine, arginine or leucine were included 
in the oligoner design, then tvo separate syntheses were 
prepared to avoid aabiguities. For exaaple, serine was 
encoded either TCM or AfiY as in 537 and 538 or 609 and 
610. Siailar codon splitting was done for arginine or 

25 leucine (e.g. 544, 545). DHA oligo»ers were synthesised 
on a Biosearch 8750 4«-column ONA synthesiser using 0^ 
cyanoethyl cheaistry operated at 0.2 sicrosole scale 
synthesis. Oligomers were cleaved off the coluan (500 
angstron CpG resins) and deprotected in concentrated 

30 anmoniuB hydroxide for 6-24 hours at 55-60*c« 

Deprotected oligoaers vere dried under vacuua (Speedvac) 
and purified by electrophoresis in gels of 15% acrylaaide 
(20 mono : l bis) , 50 nM Tris-borate-EDTA buffer 
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e ntaining 7M ur a. Full length oligoMrs v r d t«et«d 
in the gels by UV shadovlng, then the band* vere excised 
and DNA ollgoners eluted into 1.5 mis H20 for 4-16 hours 
with shaking. The eluate was dried, redissolved in O.l 
5 Bl and absorbance seasureBents vere taken at 360nB. 

Concentrations vere deterained according to the 
folloving focvula: 

(A 260 X UBits/al) (60.6/length - x /iM) 

All eligeaers vere adjusted to SO |iM concentration 
10 by addition ot BjO. 

Degenerate probes designed as above are shown in 
Fig. 20, SBQ ID NOS. 54-88. 

PGR primrs vere prepared by essentially the sane 
procedures that vere used tor pr^es vith the folloving 

IS aedifieations. tinkers of thirteen nucleotides 

containing restriction sites vere included at the 5' ends 
of the degenerate oligcmers for use in cloning into 
vectors. OKA synthesis vas perfemed at l aicrraole 
scale using 1,000 angstros resins and inesine vas 

20 used at positions irtiere all four nucleotides vere 
incorporated nen»lly into degenerate probes. 
Purifications of PCR priaers included an ethanol 
precipitation folloving the gel electrophoresis 
purification. 

25 TT. T.iiMr»yv CfiMtoietlan mnA Seraenine 

A bovine genoaie OKA library vas purchased f ron 
Stratagene (Catalogue Huaber: 945701). Tbm library 
contained 2 x 10* l5-20kb 8au3Al partial bovine DHA 
fragvents cloned into the vector laabda Dashll. A bovine 
30 total brain cDHA library vas purdiased from Clonetech 
(Catalogue Huaber: BL 10139) . coopleaentary ONA 
libraries vere constructed (In Vitrogen; Stratagene) froa 
' aKNA prepared tram bovine total brain, from bovine 
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pituitary axid f roa bovine posterior pituitary. In 
vitrogen prepared two cONA libraries: one library was in 
the vector laabda glO, the other in vector pcDHXI (a 
plasaid library) « The Stratagene libraries ware prepared 
5 in the vector lanbda unisap. Collectively, the cDA 

libraries contained 14 million priaary recoabinant phage. 

The bovine genoaio library vas plated on 2* fifili 
X12 host strain LB392 on 23 x 23 ea plates (Huno) at 
150,000 to 200,000 phage plaques per plate. Each plate 

10 represented approxiaately one bovine genome equivalent. 
Following an overnight incubation at 37«C, the plates 
were thilled and replicate filters were prepared 
according to procedures of Maniatis et al. (2t60«-81). 
Foiir plaque lifts were prepared from each plate onto 

15 uncharged nylon aeabranes (Pall Biodyne A or NSI 

Nitrppure) • The DKA vas imaobilized onto the aeabranes 
by cross-linking under OV light for 5 aimites or, by 
baking at 80 'C under vacuua for two hours. DKA probes 
vere latelled using T4 polynucleotide kinase (Mev England 

20 Biolabs) vith gaaaa 32P ATP (Nev England Kuclear; 6500 
Ci/aaol) according to the specifications of the 
suppliers. Briefly, 50 paols of degenerate DMA oligoaer 
were incubated in the presence of 600 ikCL gamma ^'p-ATP 
and 5 units T4 polynucleotide kinase for 30 minutes at 

25 37*C. Reactions were terainated, gel electrpj^horesis 
loading buffer was added and then radiolabelled probes 
were purified by electrophoresis. 32P labelled probes 
were excised froa gel slices and eluted into water. 
Alternatively, DNA probes were labelled via PGR 

30 amplification by incorporation of a-32P-dATP or a*32P 
dCTP according to the protocol of sehowalter and Soaaer, 
Anal. Biodhea 177:90-94 (1989). Probes labelled in PGR 
reactions were purified by desalting on Sephadex G-150 
coluans. 
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Prohyteidisation and hybridization vera parforaad 
in GMC buffer (0.52 M MaPi, 7t SOS, 1% BSJk, 1.5 bM BDTA, 
0.1 N NaCl 10 mg/ml tSNA). Washing was perforBed in 
oligovash (160 al 1 M Na2HP04, 200 b1 20% SDS, 8.0 ml 0.5 
5 M BDTA, 100 ml 5N MaCl, 3632 ml a20) • Typically, 20 

filters (400 mq. centimeters each) representing replicate 
copies of ten bovine genome equivalents were incubated in 
200 ml hybridisation solution vith 100 pmols of 
degenerate oligonucleotide probe (128-512 fold 

10 degenerate) . Hybridisation was allowed to occur 

overnight at 5^C belov the minimum melting tei«»erature 
calculated for the degenerate probe. The calculation of 
minimum melting temperature assumes 2*0 for an AT pair 
and 4*0 for a 60 pair. 

15 Filters vers vashed in ^^eated tihanges of 

oligowaah at the hybridisation teaqperatures four to five 
hours and finally, in 3.2N tetramethylammonium chloride, 
1% SDS tvice for 30 min at a temperature dependent on the 
DMA probe length. For 20mers, the final wash teqperatiure 

20 vas 60*0. Filters were mounted ^ then esq^sed to X-ray 
film (Kodak xaR5) using intensifying screens (Dupont 
Oronex Lightening Plus). Usually, a three to five day 
film eacposure at minus 80*0 vas sufficient to detect 
duplicate signals in these library screens. Folloving 

25 analysis of the results, filters could be stripped and 
reprobed. Filters vere stripped by incubating through 
two successive cycles of fifteen minutes in a microvave 
oven at full pover in a solution of 1% SDS containing 
lOtfK EDTA pHS. Filters vere taken through at least three 

30 to four cycles of stripping and reprobing vith various 
probes. 
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TTT. «itef«ihlf»«nfc Phaaa i«olii<; Ian, growth and PHA 

ThttM proeedur«a followed standard protocol as 
describod in p««samt»tMnfc pHx (Maniatis at al 2$60-3:SX). 

5 TV. linalv ia at TmtAmM eionaa Halno PRA PlgMtlon and 

Baooabinant Phage DHX saaples (2 aicrograas) vera 
digested according to conditions reconaendad by the 
restriction endonuclease supplier (Nev England Blolabs). 

10 Pellowing a four hoar incobation at 37*Cr tha reactions 
products vara praeipitated in the presence of O.IM sodium 
acetate and three voluaes of ethanol. Precipitated DHA 
vas collected by centrifugation, rinsed in 75% ethanol 
and dried. All resuspended saaples vere loaded onto 

15 agarose gels (typically 1% in TAB buffer; 0.04X Tria 
acetate, 0.002M BDIA) . Gel mas ware at l volt per 
centiaeter frea 4 to 20 hours. Markers included laabda 
Bind ZZZ DHA fragaanta and/or exi74BaeZZI DMA fragaanta 
(Mew England Biolabs). The gels were stained with 0.5 

20 aicrograas/al of ethidiua bromide and photographed. For 
southern blotting, DMA waa first depurinated in the gel 
by treataeat with 0.125 N BCl, denatured in 0.5 M MaOB 
and transferred in 20x BSC (3M sodiua chloride, 0.03 K 
Bodlus citrate) to uncharged nylon Beabranes. Blotting 

25 was done for 6 hours up to 24 hours, then the filters 
were neutralised in 0.5 Trie BCl pH 7.5, 0.15 M sodiua 
chloride, then rinsed briefly in 50 bM Tris-borate EDTA. 

For craaa-linking, the filters were wrapped f irat 
in transparent plaatic wrap, then the DMA side exposed 

30 for five minutes to an ultraviolet light. Bybridization 
and vaahing waa performed as described for library 
screening (see section 2 of this Example). For 
hybridization analysis to determine whether similar ganea 
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exist In other species slight sodifications were made. 
The DHA filter was purchased tros Clonetech (Catalogue 
Huaber 7753«>1) and contains S sicrograss of EcoRX 
digested DKA fros various species per lane* The probe 
5 was labelled by PCR aaq^lif Ication reactions as described 
in section 2 above, and hybridisations were done in 80% 
buffer B(2 g polyvinylpyrrolidine, 2 g Ficoll«*400, 2 g 
bovine serua albumin, 50 ml IM Tris^BCl (pR 7.5) 58 g 
NaCl, 1 g sodium pyrophosphate, 10 g sodium dodeqyl 

10 sulfate, 950ml 1^0) containing 10% dextran sulfate. Ibe 
probes vere denatured by boiling for ten minutes then 
rapidly cooling in ice water. The probe was added to the 
hybridization buffcur at 10^ dpm ^^P per ml and incubated 
overnight at 60«C. The filters were washed at 60*C first 

15 in buffer B followed by 2X SSC, 0.1% SD8 then in ix SSC, 
0*1% 8D8. For hi^ stringency, eaqperiments, final washes 
were done in 0.1 x SSC, 1% SOS and the temperature raised 
to 65*C. 

Southern blot data %rere used to prepare a 
20 restriction map of the genomic clone and to indicate 
which subf ragments hybridised to the GGP probes 
(candidates for subcloning). 

v. fiiibelonino of Saoments of DHA Homologous to 
Hybridigation Probaa 

25 DKA digests (e.g. 5 micrograms) were loaded onto 

1% agarose gels then appropriate fragments excised from 
the gels following staining. The DHA was purified by 
adsorption onto glass beads followed by elution using the 
protocol described by the supplier (Bio 101) • Recovered 

30 DHA fragments (100-200 ng) were ligated into linearised 
dephosphorylated vectors, e.g. pT3T7 (Ambion), which is a 
derivative of pDC18, using T4 ligase (Hew England 
Biolabs) . This vector carries the B. sail ^ lactamase 
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g ne, hmnc , transf r&ants can be selected on plates 
containing aaplcillin. The vector also supplies 0^ 
galactosidase cosiplesentation to the host cellt therefore 
non*reeoBbinants (blue) can be detected using 
5 isopropylthiogalaetoside and Bluogal (Bethesda Research 
Labs) • A portion of the ligation reactions was used to 
transform cell R12 XLl blue coiBpetent cells 
(Stratagene Catalogue Nu&ber: 200236) and then the 
transforaants vere selected on LB plates containing 50 
10 aicrograas per ml aaqpicillin. Nhite colonies vere 
selected and plasaid ainl preps vere prepared for Wh 
digestion and tar OKA sequence analysis. Selected clones 
vere retested to determine if their insert DMA Hybridised 
vitb the G6F probes. 

15 VT- DMA aeoueneina 

Double stranded plasmid miA templates vere 
lur^Mured from 5 ml cultures according to standard 
protocols. Sequencing vas by the dideoxy «hain 
termination method using Sequenase 2.0 and a 

20 dideoxynucleotide sequencing kit (US Biochemical) 

according to the manufacturers protocol (a modification 
of Sanger et el. FKM; USA 2i<5463 (1977)]. 
Alternatively^ sequencing vas done in a DMA thermal 
cycler (Perlcin Blmer, model 4800) using a cycle 

25 sequencing kit (Mev England Biolabs; Bethesda Research 
Laboratories) and vas performed according to 
manufacturers instructions using a S'-end labelled 
primer. Sequence primers vere either those supplied 
vith the sequencing kits or vere synthesised according to 

30 sequence determined from the clones. Sequencing 

reactions vere loaded on and resolved on 0.4mm thick 
sequencing gels of 6% polyacrylamide. Gels vere dried 
and exposed to X-Ray film. Typically, 35S vas 
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incorporated vben standard aequencing kits vere used and 
a 32P and labelled. priAer vas used for cycle sequencing 
reactions. Sequences vere read into a Wh sequence 
editor froa the bottom of the gel to the tpp (5^ 
5 direction to 3') and data vere analysed using programs 
supplied by Genetics Conputer Group (GOG, university of 
Wisconsin) • 



sola BWA Preparation and PGR Annlif ieation 

Open reading franes detected in the genomic DNA 

10 and vfaich contained sequence encoding GGP peptides vere 
extended via PCR asvlif ication of pituitary RHA. RNA vas 
prepared from frosen bovine tissue (Pelfreese) according 
to the guanidine neutral*CsCl procedure (Ghirgvin et. al. 
Biochemistry 18:5294(1979).) Polyadenylated miA vas 

15 selected by oligo-dT cellulose column chromatography 
(Aviv and Iisder FNAS (USA) 69:1408 (1972)). 

Specific roiA target sequences vere amplified 
beginning vith either total RKA or polyadenylated RKA 
samples that had been converted to cDNA using the Perlcin 

20 Elmer PCR/miA Kit Number: N808-0017. First strand 

reverse transcription reactions used 1 ^g template RMA 
and either primers of oligo dT vith restriction enzyme 
recognition site linkers attached or specific antisense 
primers determined from cloned sequences vith restriction 

25 sites attached. To produce the second strand, the 

primers either vere plus strand unique sequences as used 
in 3' RACB reactions (Prohnan et. al., PNAS (USA) £5:8998 
(1988)) or vere oligo dT primers vith restriction sites 
attached if the second target site had been added by 

30 terminal transferase tailing first strand reaction 

products vith dATP (e.g. 5' race reactions, Frohman et. 
al., ibid). Alternatively, as in anchored PGR reactions 
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thm sec nd strand prim rs ware dag nerat , hance, 
raprasanting particular peptlda saquancas. 

Tlia aapllf ioatlon prof lias follovad the following 
ganaral sdhana: 1) flva alnutas soak flla at 95«C} 2} 
5 thanuil eycla flla of 1 alnuta, 95*C; 1 alnuta raapad 
dovn to an annaallng taaparatura of 45*C, SO*C or 55*C; 
maintain tha annaallng taoperatura for ona mlnuta; ra^ 
up to 72 ovar ona minuta; axtand at 72 *C for ona minuta 
or for ona alnuta plus a 10 sacond auto axtanslon; 3) 

10 axtanslon cycla at 72*C, flva alnutas, and; 4) soak flla 
4*0 for inflnlta tiaa. Tharaal eycla fllas (#2) usually 
vara ruA for 30 i^las. A slictaan Ml salvia of aath 100 
til aaplif ication raaction vas analyzed by alaetrophorasls 
in 2% Rusiava 1% agarose gals run in TAB buffer at 4 

15 volts per centlaeter for three hours. The gals vara 

stained, then blotted to uncharged nylon asabranas vhltih 
vere probed vith labelled DMA probes that vera internal 
to the priaers. 

specific sets of DMA aaplif ication products could 

20 be identified in the blotting experiaents and their 
positions used as a guide to purification and 
reaapllf ication. lihen appropriate, the reaaining 
portions of selected saqples vere loaded onto preparative 
gels, then follovlng electrophoresis four to five slices 

25 of 0.5 aa thitikness (bracketing the eicpected position of 
the specific product) vere taken froa the gel. The 
agarose vas crushed, then soaked in 0.5 al of 
electrophoresis buffer from 2-16 hours at 40«C. The 
crushed agarose vas centrlfuged for tvo ainutes and the 

30 aqueous ^lase vas transferred to fresh tubes. 

Reaaqplif ication vas dona on five aicroliters 
(roughly 1% of the product) of the eluted aatarial using 
the same sets of priaers and the reaction profiles as in 
the original reactions. When the reaaplif ication 
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reactions vere conpleted, saaples were extracted vlth 
cfalorofon and transferred to fresh tubes. Concentrated 
restriction ensyse buffers and enxyaes were added to the 
reactions in order to cleave at the restriction sites 
5 present in the linkers. Thm digested PGR products were 
purified by gel electrophoresis, then subcloned into 
vectors as described in the subcloning section above. 
DHA sequencing was done described as above. 



10 DRA sequences vere asseabled using a fragment 

assembly program and the amino acid sequences deduced by 
the GOG programs GelAssemble, Hap and Translate. The 
deduced protein sequences vere used as a query sequence 
to seartih protein sequence databases using WordSearch. 

15 Analysis vas done on a VAX Station 3100 vorkstation 

operating under VMS 5.1. The database search vas done on 
SvissProt release number 21 using GCG Version 7.0. 

TX. Riigulta of Cloning and Samiencina of cenes encoding 

20 AS indicated above, to identify the DMA sequence 

encoding bovine GGF-ZI degenerate oligonucleotide probes 
vere designed from GGF-IZ peptide sequences. GGF-ZI 12 
(SEQ ZD Ho. 44) , a peptide generated via lysyl 
endopeptidase digestion of a purified GGF-ZZ preparation 

25 (see Figs. 16 and 12) shoved strong amino acid sequence . 
homology vith GGF-Z 07 (SBQ ZD No. 39) , a tryptic peptide 
generated from a purified GGF-Z preparation. GGF*ZZ 12 
vas thus used to create ten degenerate oligonucleotide 
probes (see oligos 609, 610 and 649 to 656 in Fig. 20, 

30 SBQ ZD Mos. 66, 67, 68 and 75, respectively). A 

duplicate set of filters vere probed vith tvo sets (set 
1-609, 610; set 2«649-5656) Of probes encoding tvo 
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ovwr lapping portions of G6F-ZI 12. Hybridisation signals 
vers observed, bat, only one clone hybridized to both 
probo sets. The done (designated 66F2B61) vas purlflsd. 

Southern blot analysis of DMA froB ths phags done 
5 GGF2B61 eonf ixBsd that both sets of probes hybsidlssd 
vlth that bevlns ssqusnes, and showsd furtbar that 
both probes reacted with the saae set of OHK fragnents 
vlthln the clmie. Based en those experloents a 4 kb Boo 
SX sub^fragsent of the original done vas Identified, 

10 subclened and partially sequenced. Pig. 21 shews the 

nndaetide seqaenee, SBQ ZD Ro. 99) and tha dsduead aalno 
acid saqusnoe of the Initial DUX sequenoe readings that 
Included ths IqrteldlBatlon sites of probes 609 and 650, 
and oonf InMd that a portion of this bovine ganoale DHA 

IS encoded peptide 12 (KASIADSGEUI) . 

Further eequenee analysis desonstrated that 06F-ZZ 
12 rosited en a 66 aaino add ^on reading fraae (see 
bdow) idilCh has beeoM the starting point for the 
isolation of overlapping sequences representing a 

20 putative bovine 66F-II gene and a cDHX. 

Several PCR procedures were used to obtain 
additional coding sequences for the putative bovine 
G6F-ZI g«M. Total mUk and oligo dT-adeeted (poly A 
containing) RHA saaplas were prepared fro» bevina total 

25 pituitary, anterior pituitary, posterior pituitary, and 
bypothalanus. Using prlners frea the list shown in Fig. 
22, SSQ ID Hos. 109-119, one-slded PCR reactions (RACE) 
were used to amplify cDMA ends in both the 3' and 5' 
diractiwis, and an^orad PCR reactions were perfened 

30 with degenerate oligonucleotide prlMrs representing 
additional GCF-ll peptides. Fig. 29 suanarlxos the 
contiguous DRA structures and sequences obtained in those 
experlnents. From the 3' RACE reactions, three 
alternatively spliced cIMA sequences vers produced. 
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%diieh have been cloned and s quenc d. A 5' RACE 
reacrtiori led to the diecovery of an additional; exon 
containing coding oeqpience for at least 52 amino acids. 
Analysis of that deduced anino acid sequence revealed 
5 peptides GGF-IZ-6 and a sequence sinilar to 66P«Z*18 (see 
belov) • The anchored PGR reactions led to the 
identification of (cDHA) coding sequences of peptides 
GGF-ZZ-1, 2, 3 and 10 contained within an additional cDMA 
segaent of 300 bp. Vhe S' liait of this segnent (i.e., 

10 segment B, see Fig. 30) is defined hy the oligonucleotide 
yhidh encodes peptide G6F-II-1 and lAiich was used in the 
PGR reaction (additional 5' sequence data exists as 
described for the human clone in Bxample 11) • Ibus this 
clone contains nucleotide sequences encoding six out of 

15 the existing total of nine novel G6F-IZ peptide 
sequences. 

The cl«Md gene vas characterized first by 
constructing a i^ysical map of G6F2B61 that alloved us to 
position the coding sequences as they were found (see 

20 belov. Fig. 30) • DNA probes from the coding sequences 
described above have been used to identify further DMA 
fragments containing the exons on this phage clone and to 
identify clones that overlap in both directions. The 
putative bovine GGF-ZI gene is divided into at least 5 

25 coding segments. Coding segments are defined as discrete 
lengths of DRA sequence lAich can be translated into 
polypeptide sequences using the universal genetic code. 
The coding segments described in Fig* 36 and referred to 
in the present application are: 1) particular exons 

30 present vithin the GGF gene (e.g. coding segment a), or 
2) derived from sets of tvo or more exons that appear in 
specific sub-groups of ARHAs, where each set can be 
translated into the specific polypeptide segments as in 
* the gene products shown. The polypeptide segments 
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r t rred to In tb clalas are the translation products of 
the analogous DNA coding segments. Only coding segnents 
A and B have been defined as axons and sequenced and 
aapped thus far. The suasary of the contiguous coding 
5 sequences Identified Is shown In Pig. 31. The axons are 
listed (alphabetically) In the order of their discovery. 
It Is apparent froa the Intron/exon boundaries that axon 
B nay be Included In cDHAs that connect coding segaent E 
and coding segaent A. That Is, axon B cannot be spliced 

10 out vlthout caaproalslng the reading fraae. Therefore, 
we suggest that three alternative splicing patterns can 
produce putative bovine G6P-II cDNA sequences 1, 2 and 
3. The coding sequences of these, designated 
66F2BFP1.CDS, 66F2BPP2.CDS and GGF2BPP3.CDS, 

IS respectively, are given In Pigs. 27A (SBQ ID No. 129), 
27B (SBQ ID No. 130) , and 27C (SBQ ID No. 131) , 
respectively. The deduced aalno add sequence of the 
three cOMAs Is also given In Pigs. 27A, (SBQ ID No. 129) , 

27B (SBQ ID No. 130) , and 270 (SBQ ID No. 131) • 
20 The three deduced structures encode proteins of 

lengths 206, 281 and 257 aalno acids. The first 183 
residues of the deduced protein sequence are Identical in 
all three gene products. At position 184 the clones 
differ significantly. A codon for glycine GOT In 
25 GGP2BPP1 also serves as a splice donor for G6P2BPP2 and 
66P2BPP3, which alternatively add on axons C, C/D, C/D' 
and D or C, C/D and D, respectively, and shown In Pig. 
32, SBQ ID No. 145). G6PIIBPP1 is a truncated gene 
product irtildh Is generated by reading past the coding 
30 segaent A splice junction Into the following Intervening 
sequence (Intron) . This represents coding segaent A' in 
Pig. 30 (89Q ID No. 136). The transcript ends adjacent 
to a canonical AATAAA polyadenylation seqpience, and we 
suggest that this truncated gene product represents a 
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bona fids nature tranacrlpt. Tha other tvo longer gene 
products shars ths sans 3' untranslated ssquenee and 
polyadenylation sits. 

All thrss of these aoleeules contain six of ths 
5 nins novsl GGF-ZZ psptids ssqusncss (sss Fig. 11) and 
another peptide is highly homologous to t»SF-I-18 (sss 
Fig. 26) • This finding givss a high probability that 
this reconbinant aolecule encodes at least a portion of 
bovins G6F-IZ. Furthsmors^ ths ealculatsd isoelectric 

10 points for ths three peptides ars consistsnt vith ths 

physical propsrtiss of GGF*-I and II. Since the molecular 
sizs of G6F*II is roughly 60 kD, ths longsst of ths three 
cDKAs should oncods a protsin vith nsarly one-half of ths 
predicted number of amino acids. 

15 A probe enco^ssing ths B and A Mons vas 

labslled via PCR amplification and used to scrsen a cl»A 
library made from RMA isolatsd from bovins posterior 
pituitary. One clone (G6F2BFF5) shovsd the pattern 
indicated in Fig. 29 and contained an additional OKA 

20 coding ssgmsnt (G) bstvesn coding ssgments A and c. Vhs 
sntire nucleic acid ssgusncs is shown in Fig. 31 (SEQ ZD 
No. 144) . Ths prsdictsd translation product from ths 
longest open reading frame is 241 amino acids* A portion 
of a sscond cDNA (GGF2BPP4) vas also isolated from ths 

25 bovine posterior pituitary library using the probe 

described above. This clons shoved the pattern indicated 
in Fig. 29. This clons is incomplste at the 5^ end, but 
is a splicing variant in ths ssnss that it lades coding 
segments 6 and D. BPP4 also displays a novel 3' end vith 

30 regions H, K and L beyond region C/D. The sequence of 
BPP4 is shovn in Fig. 33 (SEQ ID No. 146) . 
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The G6F proteins are th veabers of a nev 
superfasily of proteins. In high stringency cross 
hybridisation studies (DHA blotting eaq^iaents) vith 
other Munaalian ORAs ve have shovn, clearly, that SHA 
S probes from this bovine reeoabinant molecttle can readily 
detect specific sequences in a variety of sas^les tested. 
A highly hosologous sequence is also detected in hunan 
genomic DKA. The autoradiograa is shovn in Fig. 28. The 
signals in the lanes containing rat and human OKA 
10 represent the rat and human equivalents of the <SGF gene> 
the sequences of several cDNA's encoded by this gene have 
been recently reported by Holmes et al. (Science 2£fis 
1205 (1992)) and Wen et al. (Oell £i: 559 (1992)). 



EXMgLB 12 

15 ireiHtion Qi a Huaan gggumgft Bnff9ainq Hmnn fifira 

Several human clones containing sequences from the 
bovine GOFZI coding segment B were isolated by screening 
a human clMIA library prepared from brain stem (Stratagene 
catalog #935206) . This strategy vas pursued based on the 

20 strong link between most of the 6GF2 peptides (unique to 
G6F2) and the predicted peptide sequence from clones 
containing the bovine B segment. This library tras 
screened as described in Bxaaple 8, Section ZZ using the 
oligonucleotide probes 914-*919 listed belov. 

25 914T06G6CTCCATGSAAGAA6ATGTA (SBQ ZD NO: 179) 

915TCCATGAAGAAGATGTACCT6CT (SBQ ZD NO: 180) 

916AT6TACCT6CT6TCCTCCTT6A (SBQ ZD NO: 181) 

917TT6AAGAAG6ACT06CT6CTCA (SBQ ZD NO: 182) 

30 919ATGAR6IGTGG606GCGAAA (SBQ ZD NO: 184) 

Clones detected vith these probes vere further 
analysed by hybridization. A probe derived from coding 
segment A (see Fig. 30) , vhich vas produced by labeling a 
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polymerase chain reaction (PGR) product from e gment A, 
was also used to screen the priaary library, several 
clones that hybridized vith both A and B derived probes 
vers selected and one particular clone ^ GGF2HBS5, vas 
5 selected for further analysis. This clone is represented 
by the pattern of coding segaents (EBACC/D'O as shovn in 
Fig. 30) , The B segnent in this clone, is the hunan 
equivalent of the truncated bovine version of B shovn in 
Fig. 30. 00F2HBSS is the Bost likely candidate to encode 

10 6GF-ZZ Of all the "putative" C6F-II candidates described. 
The length of coding sequence segnent B is 786 
nucleotides plus 264 bases of untranslated sequence. The 
predicted sise of the protein encoded by G6F2HB85 is 
approxiaately 423 aaino acids (approxiaately 45 

15 kilodaltons, see Fig. 44 p SBQ ZD HO: 21), vhich is 

sisilar to the sise of the deglycosylated f ora of GGF-II 
(see Bxaaple 20). Additionally, seven of the GGF-II 
peptides listed in Fig. 26 have equivalent sequences 
%ihich fall vithin the protein sequence predicted froa 

20 region B. Peptides II-6 and 11-12 are exceptions, vhich 
fall in coding segaent B and coding segaeat A, 
respectively. SNA encoding the 66F2HBS5 protein vas 
produced in an in vito-o transcription systea driven by 
the bacteriophage T7 proaoter resident in the vector 

25 (Bluescript SK [Stratagene Inc.] see Fig. 47) containing 
the G6F2BBS5 insert. This RKA vas translated in a cell 
free (rabbit reticulocyte) translation systea and the 
sise of the protein product vas 45 Xd. Additionally, the 
cell--free product has been assayed in a Schwann cell 

30 aitogenic assay to confira biological activity. Schvann 
cells treated vith conditioned aediua shov both increased 
proliferation as aeasured by incorporation of ^^'l-Uridine 
and ^osphorylation on tyrosine of a protein in the 185 
kilodalton range. 
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ThuB the flisa f the product ancod d by GGF2HBS5 
and tha praaanca of DKA aaquaneaa vhich aneoda hunan 
paptidas highly ho&ologoua to tha bovlna paptldaa shown 
in Fig. 11 conf in that G6F2BBS5 anoodaa tha human 
5 aqoivalant of bovlna G6P2. Iha faot that oonditionad 
aadia praparad f roa calls transformad with this clona 
allcits Schwann coll adtogenic activity conf irns that tha 
G67ZIHBS5 gana produca (unlika tha BPP5 gana product) is 
sacratad. Additionally tha GGFIIBPP5 gana product saana 
10 to Mdiata tha Schwann call prolifaration rasponsa via a 
racaptor tyrosina kinasa sudh as plSS**^ or a closaly 
ralatad racaptor (saa Bxai^la 19) « 

BMgLE la 

gmBraaaloi^ BmBm BflgflflM'^'*'^^ UMmrn^Mftn unit 

15 Inaact Calla 

Tha 66P2HBS5 eDMA clona ancoding human GGTZ (as 
daseribad in Bxanpla 12 and also raf arrad to harain as 
BBSS) was clonad into vaetor pcDL-SRa296 and OOS-7 calla 
wara transf acted in 100 am diahas by tha OBAB-daxtran 

20 method. Cell lysatas or conditioned media from 

transiently expressing COS cells were harvested at 3 or 4 
daya post-transf action. To prepare lysates, cell 
monolayers ware washed with PBS, scraped from the diahas 
lysad by three fraasa/thaw cycles in ISO in of 0.25 M 

25 Tris-BCl, pBS. Call debris was pelleted and the 

aupamatant recovered. Conditioned media samples (7 
xftla.) ware collected, then concentrated and buffer 
exohangad with 10 am Trie, pH 7.4 using Centiprep-lO and 
Cantricon*lO units as deacribad Iqr tha manufactures 

30 (Aaicon, Beverly, Mk) . Rat nerve Schwann calls ware 

assayed for incorporation of WSIK aynthesis precursors, as 
described. Conditioned media or cell lysate samples were 
tested in the Schwann cell proliferation assay as 
described in Narchionni at al., Hature 362:313 (1993). 
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The CDMA, G6P2RBS5, encoding GGF2 directed the secretion 
of the protein product to the aediua. Kiniaal activity 
was detectable inalde the eella as deteralned by aasaye . 
using cell lysates. GGF2RFB1 and 6GFBPP5 cONa's failed 
5 to direct the secretion of the product to the 

extracellular aediua. GG7 activity froa these clones vas 
detectable only in cell lysates* 

Reeoobinant G6F3 vas also expressed in CHO cells » 
The GGP2RBS5 dOKh encoding G6P2 was cloned into the BeoRZ. 

10 site of vector pcdhfrpolyA and transfected into the DBFR 
negative CHO cell line (G644) by the calciua phosphate 
coprecipitation nethod. Clones vers selected in 
nucleotide and nucleoside free a aediua (Gibco) in 96- 
veil plates. After 3 weeks, conditioned aedia saaples 

15 froa individual clones were screened ^or laxpression of 
G6P by the Schwann cell proliferation assay as described 
in Karc^ionni at al.. Nature 362:313 (1993). Stable 
clones which secreted significant levels of GG7 activity 
into the aediua were identified. Schwann cell 

20 proliferation activity data froa different voluae 
aliquota of CHO cell conditioned aediua were uaed to 
produce the dose response curve shown in Fig. 46 (Grahaa 
and Van Dar Bb, Virology 52:456, 1973). This natarial 
vas analysed on a Western blot probed with polyclonal 

25 antisera raised against a G6F2 specific peptide. A band 
of approxiaately 65 Kd (the expected size of GGF2 
extracted froa pituitary) is specifically labeled (Fig. 
48, lane 12). 

Recoabinant GGF2 was also expressed in insect 

30 cells using the Baculovirus expression. Sf9 insect cells 
were infected with baculovirus containing the GGP2HB85 
cDNA clone at a aultiplicity of 3-5 (10^ cells/al) and 
cultured in Sf 900-ZZ aediua. Schwann cell aitogenic 
activity waa sscreted into the extracellular aediua. 
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Different volumes of Insect cell conditioned aediim were 
tested in the Schwann cell proliferation assay in the 
absence of forskolln and the data used to produce a dose 
response curve* 

5 Zbls aaterial was also analyzed on a Western blot 

(Fig. 45B) probed with the G6F ZZ specific antibody 
described above. 

The methods used in this exaiple vere as follows: 
Schwann cell aitogenic activity of recombinant 
10 human and bovine glial growth factors was determined as 
follows: Mitogenic responses of cultured Schwann cells 
were measured in the presence of 5 |iN forskolln using 
crude recombinant 66P preparations obtained from 
transient mammalian expression experiments. 
15 Zncorporation of ["^Ij-Utd was determined following an 
lS-24 hour exposure to materials obtained from 
transf acted or mode transf acted cos cells as described in 
the Itothods. The mean and standard deviation of four 
sets of data are shown. The mitogenic response to 
20 partially purified native bovine pituitary 66P 

(carboxymethyl cellulose fraction; Goodearl et al., 
submitted) is shown (66F) as a standard of one hundred 
percent activity. 

cnas (Fig. 46, SBQ ZO NOs. 166-168) were cloned 
25 into pcDli*SRa296 (Talcebe et al., Mol. Cell Biol. 8s466* 
472 (1988)), and OOS-7 cells were transf acted in lOO mm 
dishes by the DEAB-dextran method (Sambrook et al.. In 
Jiroieeuiar Cloning. Jl Laboratory Manual^ 2na. ad. (Cold 
Spring Barbor Laboratory Press, Cold Spring Harbor, NY, 
30 1989) )^. cell lysates or conditioned media were harvested 
at 3 or 4 days post-transf action. To prepare lysates, 
cell monolayers were washed with PBS, scraped from the 
dishes, and lysed by three freeze/ than cycles in 150 Ml 
of 0.25 M Tris-HCl, pH 8. Cell debris was pelleted and 
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the 8Upernat« recovered. Conditioned media eapples (7 
ale) vere collected, then concentrated and buffer 
exchanged with 10 BH Trie, |^ 7.4 ueing Centriprep^io and 
Centricon*10 units are described by the nanufaeturers 
5 (JkMiconi Beverly, MX) • Rat sciatic nerve Schwann cells 
were assayed for incorporation of DMA synthesis 
precursors, as described (Davis and Stroobant, J. Cell 
Biol. 110:13S3-1360 (1990); Broclces et al.. Brain Res. 
165:105-118 (1979)). 
10 western blot of reconbinant CBO cell conditioned 

m 

BediuB were perfomed as follows: A reconbinant CBO 
clone was cultured in MCDB302 protein-free for 3 days. 2 
ml of conditioned nediuB was harvested, concentrated, 
buffered exchanged against 10 aN Tris-BCl, 7.4 and 

15 lyophilised to dryness. Ths pellet was resuspended in 
SD8-PA6B sapple buffer, subjected to reducing SDS gel 
electrophoresis and analysed by Western blotting with a 
GGF peptide antibody. A CBO control «ras done by using 
conditioned aediuB from untransfected CB0-DG44 host and 

20 the CBO BBSS levels were assayed using conditioned nediun 
from a reooabinant clone. 

EXMglig 14 

Idtntifioetlon of rungtional Eloaonts of ggr 

The deduced structures of the f aaily of 06P 
25 sequences indicate that the longest foras (as represented 
by G6F2BPP4) encode transaeabrane proteins where the 
extracellular part contains a doaain which reseables 
epideraal growth factor (see Carpenter and Wahl in 
Peptide Growth Factors and Their Receptors I pp. 69-133, 
30 Springer-Verlag, NY 1991) . The positions of the cysteine 
residues in coding segaents C and C/D or C/D' peptide 
sequence are conserved with respect to the analogous 
residues in the epideraal growth factor (E6F) peptide 
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sequenc (0 a Pig. 32, SEQ ID Koo. 147-149). This 
suggests tlmt the extracellular donaln functions as 
receptor recognition and biological activation sites. 
Several of the variant forms lack the H, K, and L coding 
S segments and thus may be expressed as secreted^ 
diffusible biologically active proteins. 66P VHK 
sequences encoding polypeptides which encoaqpass the EOF* 
like domain (B6PL) can have full biological activity for 
stimulating glial cell mitogenic activity. 

10 membrane bound versions of this protein may induce 

mciivann cell ^oliferation if expressed on the surface of 
neurons during embryogenesls or during nerve regeneration 
(irtiere the surfaces of neurons are intimately aesociated 
vith the surfaces of proliferating Schwann cells) • 

15 Secreted (non membrane bound) GGPs may act as 

classically diffusible factors which can interact vith 
Schwann cells at seme distance from their point of 
eeoretion. Other forms may be released from intraeells 
by sources via tissue injury and cell disn^ion. An 

20 exaq^le of a secreted G6F is the protein encoded by 

66P2RBS5; this is the only 66F known which has been found 
to be directed to the exterior of the cell* Secretion is 
probably mediated via an N-terminal hydro|diobie ssquence 
found only in region E, whieh is the H-terminal domain 

25 contained within recoiribinant GGF2 encoded by G6F2HBS5. 

Other GGF's appear to be non-secreted. These OGFs 
may be injury response forms which are released as a 
consequence of tissus damage. 

Other regions of the predicted protein structure 

30 of GGF2 (encoded by G6F2HBS5) and other proteins 

containing regions B and A exhibit similarities to the 
human basement membrane heparan sulfate proteoglycan core 
protein. The peptide ADSGEY, which is located next to 
the second cysteine of the C2 Immunoglobulin fold in 
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these QGFfB, oeeure in nine of tventytvo C-2 repeats 
found in that basal lamina protein. This evidence 
strongly suggests that these proteins may associate vith 
matrix proteins such as those associated vith neurons and 
5 glia, and may suggest a method for sequestration of glial 
growth factors at target sites • 

aawpiiB ig 

Purlf Icatton 9t ggrg tm ReggHblmtnt CglU 

In order to obtain full length or portions of GGPs 

10 to assay for biological activity, the proteins can be 

overproduced using cloned DNA. Several approaches can be 
used. A recombinant £. fioH cell containing the 
sequences described above can be constructed* Expression 
systems such as pMHOa or pHHlSa (Stratagene, Inc.) can be 

15 used for this purpose by folloving manufacturers 
procedures. Alternatively, these sequences can be 
inserted in a mammalian expression vector and an 
overproducing cell line can be constructed. As an 
example, for this purpose DNA encoding a G6F, clone 

20 G6F2BPP5 has been expressed in COS cells and can be 

expressed in Chinese hamster ovary cells using the pHSXHD 
expression vector (Lee and Nathans, J. Biol. Chem. 2fi2f 
3521-3527, (19S1) } . This vector containing G6F OHA 
sequences can be transfected into host cells using 

25 established procedures. 

Transient expression can be examined or 
6418*resistant clones can be groim in the presence of 
methotrexate to select for cells that amplify the dhfr 
gene (contained on the pMSXND vector) and, in the 

30 process, co-amplify the adjacent GGF protein encoding 
sequence. Because CHO cells can be maintained in a 
totally protein*free medium (Hamilton and Ham, In Vitro 
11, 537-547 (1977)), the desired protein can be purified 



FCT/US94/aS0S3 



- 71 - 

from the aadiuB. Hastern analysis using the antisera 
produced in Bxample 17 can be used to detect the presence 
of the desired protein in the conditioned aediua of the 
overproducing cells. 
S Thm desired protein (r66F2) was purified from the 

mediUB conditioned by transiently expressing cos cells as 
follovs. rGGF II vas harvested froa the conditioned 
aediuB and ]>artially purified using Cation Exchange 
Chromatography (POROS^HS). She column was equilibrated 

10 vith 33.3 «K MBS pB 6.0. Conditioned media vas loaded at 
flov rate of 10 nl/mln. She peak containing Stihvann cell 
proliferation activity and imnunoreactive (using the 
polyclonal antisera vas against a G6F2 peptide described 
above) vas eluted with 50 bM Trie, IK NaCl pB 8.0. 

15 rhGGR is also expressed using a stable Chinese 

Ovary Bamster cell line. rGGF2 from the harvested 
conditioned media vas partially purified using Cation 
Exchange Chronatograph (POROS*BS). The column vas 
equilibrated vith PBS pB 7.4. conditioned madia vas 

20 loaded at 10 ml/min. The peak containing the Schwann 
Cell Proliferative activity and imnunoreactivity (using 
CSGF2 polyclonal antisera) vas eluted vith SO mM Bepes, 
500 bM NaCl pB 8.0. An additional peak vas observed at 
50 «K Bepes, IM NaCl pH 8.0 vith both proliferation as 

25 veil as immunoreactivity (Pig* 45) • 

rhG6F2 can be further purified using Bydrophobic 
Interaction Chromatography as a high resolution step; 
Cation exchange/Reserve phase Chromatography (if needed 
as second high resolution step); A viral inaetivation 

30 step and a DBA removal step such as Anion exchange 
chromatography. 

Schvann Cell Proliferation Activity of recombinant 
GGF2 peak eluted from the Cation Exchange column vas 
determined as follovs: Nitogenic responses of the 
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cultured Sohimiin cells were measured in the presence of 5 
M Porskblin using the peak eluted by 50 AM Trie 1 N NaCl 
pH 8.0. The peak vas added at 20 l, 10 1 (1:10) 10 1 and 
(1:100) 10 !• Incorporation of "^I-Oridlne was 
5 determined aitf ex|n:essed as (CPM) folloving an 18-24 hour 
exposure. 

An iasnuioblot using polyclonal antibody raised 
against a peptide of 6GP2 vas carried out as follows: 10 
1 of different fractions were ran on 4-12% gradient gels. 

10 n&e gels were transferred on to Nitrocellulose paper, and 
the nitrocellulose blots were blocked with 5% BSk and 
probed with GGP2-specific antibody (1:250 dilution). ^^^I 
protein A (1:500 dilution. Specific Activity - 9.0/Ci/g) 
was used as the secondary antibody. The ioBunoblots were 

15 esvosed to Xodax X-nay films for 6 hours. The peak 

fractions eluted with 1 M NaCl showed an iasninoreactive 
band at 69X. 

06P2 purification on cation exchange columns was 
performed as follows: CHO cell conditioned media 

20 expressing rGGFII was loaded on the cation exchange 
column at 10 ml/nin* The column was equilibrated with 
PBS 1^ 7.4. The elution was achieved with 50 tfK Hopes 
500 mN NaCl pH 8.0 and 50 bH Hopes IN NaCl pH 8.0 
respectively. All fractions were analysed using the 

25 Schwann cell proliferation assay (CFN) described herein. 
The protein concentration (ng/ml) was determined by the 
Bradford assay using BSA as the standard. 

A Western blot using 10 4 of each fraction was 
performed and immunoreactivity and the Schwann cell 

30 activity were observed to co-*nigrate. 

The protein may be assayed at various points in 
the procedure using a Western blot assay. Alternatively, 
the Schwann cell mitogenic assay described herein may be 
used to assay the expressed product of the full length 
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cl n or any bi logically active p rtions ther of* The 
full length clone 6GP2BPP5 has been expressed transiently 
in COS cells* Intracellular extracts of transf acted COS 
cells shov biological activity vhen assayed in the 
5 Sdnfann cell proliferation assay described in Bxa^>le 8. 
In addition, the full length close encoding G0F2BBS5 has 
been expressed transiently in COS cells, in this case 
both cell extract and conditioned aedia shov biological 
activity in the Sch%iann cell proliferation assay 

10 described in Exaaple 8. Any neaber of the faaily of 

splicing variant coqpleaentary UtJk's derived froa the G6P 
gene (including the Heregulins) can be eivressed in this 
nanner and assayed in the Schwann cell proliferation 
assay by one skilled in the art. 

15 Alternatively, recombinant material aay be 

isolated fron other variants according to Wen et al. 
(Cell ^:559 (1992)) who expressed the splicing variant 
Meu differentiation factor (NOP) in 00S*7 cells* «DliA 
clones inserted in the pJT-2 eukaryotic plasnid vector 

20 are under the control of the SV40 early promoter, and are 
3'«flanked with the SV40 termination and polyadenylation 
signals* COS-7 cells were transf acted with the pJT*2 
plasmid DMA by eleotroporation as follows: 6 x 10^ cells 
(in 0.8 ml of DilBM and 10% PEBS) were transferred to a 

25 0*4 em cuvette and mixed with 20 |ig of plasmid DMA in 10 
Ml of TB solution (10 aM Tris-HCl (pH 8*0), 1 AN EDTA)* 
Eleotroporation was performed at room temperature at 1600 
V and 25 fiT timing a Bio-Rad Gene Pulser apparatus with 
the pulse controller unit set at 200 ohms* The cells 

30 were then diluted into 20 ml of OMEN, 10% PBS and 

transferred into a T75 flask (Palcon) * After 14 hr. of 
incubation at 27 •C, the medium was replaced with DMEM, 1% 
PBS, and the incubation continued for an additional 48 
hr« Conditioned medium containing recombinant protein 
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Which vas harvested from the cells deaonstrated 
biological activity in a cell line eaepressing the 
receptor for this protein. This cell line (cultured 
huaan breast carcinoma cell line AU S65) was treated vith 
5 reconbinant eater ial. The treated cells exhibited a 
morphology change which is characteristic of the 
activation of the erbB2 receptor. Conditioned medium of 
this type also can be tested in the Schwann cell 
proliferation assay* 

ii^fc^inal sequence analysis 
The cmiA encoding hG6F2 was cloned into the 
a^lif iable vector pcdhfrpolyA and transfected into CBO- 
D644 cells for stable expression. rhG6F2 is secreted 

15 into the conditioned media. The ability of the 

recombinant GGF2 to be secreted is presumably mediated 
through the N-^terminal hydrophobic stretch (signal 
sequence) . A signal sequence, once having initiated 
eagport of a growing protein chain across the rough 

20 endoplasmic reticulum, is cleaved from the mature protein 
at a specific elte. H-terminal sequence analysis of the 
expressed and purified rhGGF2 indicates the site of 
cleavage as shown below. The sequence of the first 50 
amino acid residues at the N-terminus of the protein was 
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c nfiraed by H-teralnal sequence analysis (Table 5), 

belev. 

VABLM 8 

N-teninal sequence analysis of rtaG6F2 





Priaaxy 
Sequence 


pMoles 


1 


Cly (C) 


210.6 


2 


Asn (H) 


183 


3 


GLU (E) 


149 


4 


Ala (A) 


220 


5 


Ala (A) 


180 


6 


Pro (P^ 


173 


7 


Ala (A) 


177 


8 




154.9 


9 


Ala (A) 


162.4 


10 


8«r (8) 


65.4 


11 


val (V) 


132.7 


12 


val (V) *(cys) 


11.7 


13 


Tyr (Y) 


112.7 


14 


ser (8) 


47.6 


15 


Ser (8) 


27.1 



10 



15 



20 



•cys rasidttsa am testroy^d by tte Bdun DAgradatloii Wroemnm ud 
eaiiaot hm dmtmatmd 



25 The follevlng sequence (SEQ ID NO: 185) represents 

the amino aold sequence of hG672. The shaded area 
indicates the cleaved signal sequencoe 



30 



GNEAAPA6AS VCYSSPPSVC 8VQBLAQRAA VVIB6KVHPQ RRQQGALDRK 
AAAAA6EAGA NGGDRETOAA GntALGPPAE EPUAANGTV PSNPTAPVPS 
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AGBPGEEAPy LVRVRQVHilV KAGGLKXDSL LTVRLGTfiGH PAPPS06RIX 
BDSRYXPFifB PDANST8RAP AAFRASFPPL ETGRNLKKEV SRVLCKRCKL 
PPQLXENR8Q BSAJIGSKLVL RCETSSEYSS LRFKWFXNGN SUIRXIIKPQN 
IXIQKXP6XS ELRINKASLA DSGEYMCKVI SKLGNDSASA NITIVESNAT 
5 ST8TTGTSHL VKGAEKBRTF GVNGGSCFIIV KDLSNPSRYL CKCPNBFTGD 
RCQNYVKASF YSTSTPFLSL PE (SEQ ID HO: 185) 



Hhm shaded area represents eagperiAentally deterained 15 
amino acid residues at the H-terainal of the rhOGP2, 
indicating A5o*<Ssi hatiA to be the cleavage site for the 
10 signal sequence* 

aMgLB 17 

IflQlrtlon Qt a rurthtr Spllcliw Yeriant 

Methods for iqpdating other neuregulins descsribed 
in U.S. patent application Serial No. 07/965,173, filed 

15 October 23, 1992, incorporated herein by reference, 

produced four closely related sequences (heregulin a, ^1, 
fi2, fi3) irtiich arise as a result of splicing variation. 
Poles et al. (Cell <fi2:205 (1992)), and Wen et al. (cell 
£2:559 (1992)) have isolated another splicing variant 

20 (froB rat) using a siailar purification and cloning 
approach to that described in Exaaples 1-9 and 11 
involving a protein vhich binds to pl85*'^. The cDNA 
clone was obtained as follows (via the purification and 
sequencing of a plSS**^ binding protein from a 

25 transformed rat fibroblast cell line). 

A pl85*'^^ binding protein was purified from conditioned 
nedium as follows • Pooled conditioned mediua from three 
harvests of 500 roller bottles (120 liters total) vas 
cleared by filtration through 0.2 m filters and 

30 concentrated 31«>fold vith a Pelicon ultrafiltration 

system using membranes vith a 20kd molecular size cutoff. 
All the purification steps vers performed by using a 
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Pharaacia fast pr t in liquid chromatography system. The 
concentrated material wlb directly loaded on a column of 
heparin-sepharoae (150 ml, preequilibrated with 
phosphate-buffered saline (PBS)). The column was vashed 
5 vith PBS containing 0.2 M NaCl until no absorbance at 280 
nm wavelength could be detected. Bound proteins vere 
then eluted with a continuous gradient (250 ml) of HaCl 
(from 0*2 M to 1.0 N), and 5 ml fractions were collected. 
Samples (O.oi ml of the collected fractions were used for 

10 the quantitative assay of the kinase stimulatory 

activity. Aotive fractions from three column runs (total 
volume « 360 ml) were pooled, concentrated to 25 ml ky 
using a YMIO ultrafiltration membrane (Amicon, Oanvers, 
Mk) , and ammonium sulfate vas added to reach a 

15 concentration of 1.7 N. After clearance by 

oentrifugation (10,000 x g, IS min.), the pooled material 
was loaded on a ptaenyl-Siqperose column (BRlO/10, 
niarmaeia) . The column was developed with a 45 ml 
gradient of (104)3804 (from 1.7 M to no salt) in 0.1 M 

20 Na2P04 (pH 7.4), and 2 ml fractions were collected and 
assayed (0.002 ml per sample) for kinase stimulation (as 
described in 2xam>le 19) • The major peak of activity was 
pooled and dialysed against 50 mM sodium phosphate buffer 
(pR 7.3). A Nbno-S cation*exchange column (BR5/5, 

25 Pharmacia) was inreequilibrated with 50 mN sodium 

phosphate. After loading the active material (0.884 mg 
of protein; 35 ml) , the column was washed with the 
starting buffer and then developed at a rate of 1 ml/nin. 
. with a gradient of NaCl. The kinase stimulatory activity 

30 was recovered at 0.45-0.55 H salt and was spread over 
four fractions of 2 ml each. These were pooled and ' 
loaded directly on a Cu*^^ chelating columns (1.6 ml, HR2/5 
chelating Super ose, Pharmacia). Most of the proteins 
adsorbed to the resin, but they gradually eluted with a 
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30 al lin ar gradient f aamonlua chloride (0*1 K) • The 
activity eluted in a single peak of protein at the range 
of 0.05 to 0.2 M NB^Cl. Sanples froa various steps of 
purification were analysed by gel electrophoresis 
5 folloved by silver staining using a kit froa ZCH (Costa 
Mesa, CA), and their protein contents vere deterained 
with a Cooaassie blue dye binding assay using a kit froa 
Bio-Rad (Richaondf CK). 

The p44 protein (10 m9) vas reconstituted in 200 

10 Ml of 0.1 M aaaoniua bicarbonate buffer (pH 7.8) • 
Digestion vas conducted vith L*l-tosyl-aaide 
2«'phenylethyl chloroaethyl ketone-treated trypsin (Serve) 
at 37 ^C for 18 hr. at an enzyae-to-substrate ratio of 
1:10. She resulting peptide aixture vas separated by 

15 reverse^phase HPLC and aonitored at 215 na using a vydac 
C4 aicro coluan (2.1 aa i.d. x IS ca, 300 A) and an HP 
1090 liquid ohroaatographic systea equipped vith a 
diode*array detector and a vorkstation. The coluan vas 
equilibrated vith 0.1% trifluoroacetic acid (aobile phase 

20 A), and elution vas effected vith a linear gradient froa 
0t*55% aobile j^se B (90% aeetonitrile in 0.1% 
trifluoroacetic acid) over 70 ain* The flov rate vas 0.2 
al/ain. and the coluan teaperature %ras controlled at 
25 *C. One-third aliquots of the peptide peaks collected 

25 aanually froa the HPLC systea vere characterised by 

N-teralnal sequence analysis by Edaan degredation. The 
fraction eluted after 27.7 ain. (T27«7) contained aixed 
aaino acid sequences and vas further rechroaatographed 
after reduction as follovs: A 70% aliquot of the peptide 

30 fraction vas dried in vacuo and reconstituted in 100 $ul 
of 0.2 M eaaoniua bicarbonate buffer (pH 7.8) • DTT 
(final concentration 2 «N) vas added to the solution, 
vhich vas then incubated et 37«C for 30 ain. The reduced 
peptide aixture vas then separated by reverse-phase HPLC 



using a vydac coluan (2.1 mm i.d. x 15 cm) • Elution 
conditions and f lov rat'vere identical to those described 
above. A&ino acid sequence analysis of the peptide was 
perforaed with a Model 477 protein sequencer (Applied 
5 Biosysteas, Inc., Poster City, Ck) equipped vith en 

on-line phenylthiohydantoin (PIH) aaino acid analyser and 
a Model 900 data analysis systea (Bunkapiller et al. 
(1986) In Methods ot Protaln Mieroeharaetsrigation. J.E. 
Shively, ed. (Clifton, Nev Jersey: Huaana Press p. 223- 

10 247) « The protein was loaded onto a trifluoroacetic 
acid-treated glass fiber disc precycled vith polytarene 
and NaCl. Ihe PTH-aaino acid analysis was performed vith 
a Micro liquid diroaatography systea (Model 120) using 
dual syringe puqm and reverse-phase (C-18) narrov bore 

15 coluans (Applied Biosysteas, 2.1 aa x 250 aa)* 
RKA vas isolated f roa Rati«-EJ cells by standard 
procedures (Naniatis et al.. Molecular clonings A 
Laboratory Manual (Cold spring Harbor, Nov Yorlc (1982) 
and poly (A)^ was selected using an aiuiA separator Icit 

20 (Clonte^ Lab, Inc., Palo Alto, CA). cDNA vas 

synthesised vith the superscript kit (from BRL Life 
Technologies, Inc., Bethesda, ND) • Coluan-fractionated 
double-strand CDMA vas ligated into an Sail- and 
Notl-digested pJT-2 plasaid vector, a derivative of the 

25 pCD-x vector (COcayaaa and Berg, Mol. cell Biol. 1: 280 
(1983)) and transformed into imiOB E* fisli cells by 
eleotroporatlon (Dover et al., Nuel. Acids Rss. Ifi: 6127 
(1988) ) • Approxiaately 5 x lo' priaary transf oraants 
vere screened vith tvo oligonucleotide probes that vere 

30 derived frM the protein sequences of the N-terainus of 
HDP (residuiss 5-24} and the T40.4 tryptic peptide 
(residues 7-12) • Theix respective sequences vere as 
follovs (N indicates all 4 nt): 
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(1) 5'-ATA GG6 AA6 6GC G66 GGX AG6 GTC NCC GTC M6C 
AT 

AG6 6CC G66 CTT 6CC TCT 6GA GCC TCT*9' 

(2) 5'-lTT ACA CAT ATA TTC MCC-3' 
5 C 6 6 C 

111 SBQ ZD No. 1«3; 2: 8EQ ZD Mo. 164) 

The synthetic oligonuelMtldes were end^labaled with 
[Y-^^P]ATP with T4 polynucleotide kinase and used to 
screen replicate sets of nitrocellulose filters* Hie 

10 hybridization solution contained 6 x SSC, 50 sN sodiua 
phosphate (pB «*8), 0,1% sodiua pyrophosi^te, 2 x 
Denhardt's solution, 50 Mg/&1 salBon spam l»A, and 20% 
forBaaide (for probe l) or no foraanide (for probe 2). 
The filters were vashed at either 50«C vith O.S x S8C, 

IS 0.2% 808, 2 an EDTA (for probe 1) or at 37«C vith 2 x 

88C, 0.2% SD8, 2 aK BDTA (for probe 2). Autoradiography 
of the filters gave ten clones that hybridised vith both 
probes. These clones vere purified by replating and 
probe hybridisation as described above. 

20 The cDNA clones vere sequenced using an Applied 

Biosysteas 373A autooated DMA sequencer and J^plied 
Biosysteas Taq pyeDeoxy* Terainator cycle sequencing kits 
follovlng the aanufacture's instructions. In soae 
instances, sequences vere obtained using [''8]dATP 

25 (Aaershaa) and Bequenase* kits froa U.8. Biocheaicals 

folloving the aanufacturer's instructions. Both strands 
of ths cDMA clone 44 vere sequenced by using synthetic 
oligonucleotides as priaers. The sequence of the aost 5' 
350 nt vas deterained in seven independent cONA clones. 

30 The resultant clone deaonstrated the pattern shown in 
Fig. 27 (MDF). 



W0Ma«2M 



PCT/I)SM/IQS083 



- 81 - 

BXftWPM 19 

Purification and Aaaav at Ofchar Proflna whleh Mnd 

I. PurlfleatlBn iroao and p70 
5 iMpu at al. (Science 2i2« 1552 (1990)) end LlppBan 

and Lupu (patent aK>lloatlon nuaber PCT/U891/03443 
(1990) ) , hereby Incorporated by reference, have purified 
a protein from conditioned aedla of a huaan breast cancer 
cell line IIDAHIB-231. 

10 Liqpv et al. (Proe. lietl. Aoad. 8oi. fli, 2297 

(1992)) purified another protein fihLdh binds to the 
p]^S5*AB2 receptor. This particular protein, p75, was 
purified from conditioned aediua used for the growth of 
SXBr-3 (a huaan breast cancer cell line) j^ropagisited in 

15 iapreved Eagle's aediua (IMEM: 6ZBC0) suppleaented with 
10% fetal bovine swua (6ZB00). 



Peles et al. (Cell §2* 205 (1992)) have also 
purified a i85«»*** stiaulatlng ligand from rat cells. 
20 Holaes et al. (Science 1205 (1992)) have purified 
Heregulia a froa huaan cells which binds and stiaulates 

Bnaple 5). Tarakevsky et al. oncogene 
6:218 (1991) have deaonstrated bending of a 25 kO 
polypeptide isolated froa activated aaerophages to the 
25 Neu. receptor, e plS5*'^ hraology, herein incorporated by 
reference. 

ITT. WDF Taalattmi 

Yarden and Peles (Bieeheaistry 2ft, 3543 (1991) ) 
have identified a 35 kllodalton glyc^rotein trtilch will 
30 stiaulate the ie5*">^ receptor. 
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Zn other publications, Davis st al. (Biochem. 
Biophys. Res. Coamun. ili, X536 (1991), Proc. Natl. Acad. 
Scl. a&i 8582 (1991)^ and Greene et al«, PCT parent 
application PCT/US9 1/023 31 (1990)) describe the 
5 purification of a protein fron conditioned sediuB of a 
huaan T-cell (ATlr^) cell line. 

Huang et al. (1992, J. Biol. Chea. 257:llS08- 
11S12) , hereby incorpwated by reference, have isolated 
an additional neu/erb B2 ligand grovth factor froa bovine 

10 kidney. The 25 kD polypeptide factor vas Isolated by a 
procedure of coluan fractionation, folloved by sequential 
ooluan chroaatography on DEAB/cellulose (DB52), Sulfadex 
(sulfated Sephadex G-50), heparin^Sepharose 4B, and 
Siqmrdex 75 (fast protein liquid chroaatography) • The 

15 factor, NBL-GP, stiaulates tyrosine«*specific 

autophosphorylatlon of the neu/erb B2 gene product. 

ISL — Purification of acfttvlchQlina r^emtor indueina 
flCtiYttY (Mlh) 

ARIA, a 42 kD protein iihich stiaulates 

20 acetylcholine receptor synthesis, has been isolated in 
the laboratory of Gerald Fischbach (Falls et al., (1993) 
Cell 72:801-815) • ARIA induces tyrosine phosphorylation 
of a 185 Kda aoscle transaeabrane protein which reseables 
p]^35erba2^ and stiaulates acetylcholine realtor synthesis 

25 in cultured eabryonic ayotubes. ARIA is aost likely a 
aeaber :of the GGF/erbB2 ligand qroap of proteins, and 
this is potentially useful in the glial cell aitogenesis 
stiaulation and other applications of, e.g., G6F2 
described herein. 
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Protein tvroaina PhQaphoyyl ufcion mediatiad hy GCP 

Rat Schwann cells, folloving treatment vith 
sufficient levels of Glial Growth Factor to induce 
5 proliferation, show stisulation of protein tyrosine 
phosphorylation. Varying anounts of partially purified 
G6F were applied to a prlaary culture of rat Schwann 
cells according to the procedure outlined in Exaaple 9. 
Schwann cells were grown in DMBK/10% fetal calf serua/5 
10 |iK forAolin/O.Sif&g per aL GGF--GK (O.SaL per well) in poly 
D-lysine coated 24 well plates. Khen confluent, the 
cells were fed with DlfEK/10% fetal calf serua at o.SttL 
per well and left in the Incubator overnight to gulesce. 
The following day, the cells %fere fed with 0.2bL of 
15 DMBC/lOt fetal calf serua and left in the incubator for 1 
hour. Test saaples were then added directly to the 
aediua at different concentrations and for different 
lengths of tiae as required. The cells were then lysed 
In boiling lysis buffer (sodlun phosphate, 5nH, pH C.8; 
20 SOS, 2%, ^--Bercapteothanol, 5%; dlthlothreltol, O.IM; 
glycerol, 10%; Broaophenol Blue, 0.4%; sodlun vanadate, 
lOBM) , incubated in a boiling water bath for 10 ainutes 
and then either analysed directly or frosen at -70*c. 
Samples were analysed by running on 7.5% SD8-PAGB gels 
25 and then electroblotting onto nitrocellulose using 

standard procedures as described by Towbin et al. (1979) 
Proc. Natl. Acad. Sol. USA 76:4350-4354. The blotted 
nitrocellulose was probed with antlphosphotyrosine 
antibodies using standard aethods as described in Kemps 
30 and Selton (1988) Oncogene 2:305-315. The probed blots 
were exposed to autoradiography f ila overnight and 
developed using a standard laboratory processor. 
Dens itoaetrlc measurements were carried out using an 
Ultrascan XL enhanced laser densitometer (LKB) . 
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Nolaeular v ight assignaents v re mad relative t 
preatained hi^ volecular weight atandarda (Sigaa) . The 
doae reaponaea of protein phoaphorylation and Schwann 
cell proliferation are very aiailar (Fig. 33). The 
5 Bolecular weight of the phoaphorylated band ia very cloae 
to the molecular weight of pl85^^^^. Similar reaulta 
were obtained when Schwann cella were treated with 
conditioned media prepared from COS cella tranalatea with 
the G6P2BBSS clone. Theae reaulta correlate well with 
10 the expected interaction of the GGFa with and activation 
of 185«*»2. 

Thia experiment haa been repeated with recombinant 
66F2. Conditioned medium derived from a CHO cell line 
atably tranaformed with the 66F2 done (G6F2BB85) 
15 atimulatea protein tyroaine phoaphorylation uaing the 
aaaay deacribed above. Mock tranaf acted CHO cella fail 
to atianlate thia activity. 

EXMgLB 2Q 

M-glveogvlaticn of GCP 

20 The protein aequence predicted from the cSNA 

aequence of GGF-ZZ candidate clonea 66F2BPP1,2 and 3 
containa a number of conaenaua N-glycoaylation motif a. A 
gap in the G6FZI02 peptide aequence coincidea with the 
aaparagine reaidue in one of theae motif a, indicating 

25 that carbohydrate ia probably bound at thia aite. 

M-glycoaylation of the GGFa waa atudied ^ 
obaerving mobility Pangea on 8DS-PAGE after incubation 
with M-glycanaae, an ensyme that cleavea the covalent 
linkagea between carbohydrate and aapargine reaiduea in 

30 proteina. 

M-6lyeanaae treatment of GGF-IZ yielded a major 
band of KH 40-42 kDa and a minor band at 45-48 kOa. 
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Activity singl active d glycosylated spaciea at ca 45- 
50 kDa. 

Activity elation experiments with G6F-Z also 
demonstrate an inoreaae in eleetrophoretio mobility vhen 
5 treated with H-61ycanase, giving an active species of im 
26-28 kDa. Silver staining confirmed that there is a 
mobility shift # although no N-deglycosylated band could 
be assigned because of background staining in the sample 
used. 
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(2) GBRimz. xnroRNiinoiii 



(i> APPLZGABTSt Rdtart Sklar, lUrk Marehionai, 
OAvid Z. Owynnm 

(U) TZTU or ZNVmZCMi: KBTBODS FOR ALTBRZBO 

KD8CL8 OOBDZTZOII 

(Hi) mniBBR or sbq^cbsi 18S 

(Iv) CORRBSPOHDSIfCB MORBSSt 

ih) ADOnsSBBi Fish S Richardson 
CB) 8TRSBTI 22S Franklin StrMt 

(C) CZTYt Boston 

(D) STRTBs Nassaehuastts 
(F) XZPt 02110-2804 

<▼) COWU T BR RBRDABUB FORMs 

CR) HBDZm nPBi Oiskatto* S.25 inch, 360 

kb stora^o 
(B) CQMPRBRt ZBM 
CO) OTBRAirZXG SySTXHi PC-008 
(D) SOFTMRRB: WordporfOCt 

|vi> CDRRBNT APPLZCRTZON DRlRs 

<R) APPLZCRTZOH NUMBBRs 94/05083 A 

(B) rZUBO ORTBi 06-MRI-94 

(C) GLRSSZPZCRTZONt 

(▼ii) PRZOR RPPLXCRTZON DATRt 

(R) RPPLZCRTZOil IfOMBBRs 08/209r204 
(B) rZLZNO DRTBt 08*IIRR-94 

(▼ii) PRIOR RPPLXCRTZON DRtRt 

|R) RPPX.ZCRTZON NOMBBRi 08/059,022 
(B) riLZHO DRTSt 06-ICay-93 

(viii) RTlQltRBir/RGBlIT ZNFORMRTZON: 

<R) SRKBi Clark, Paul T. 

(B) RB6Z8TRRTZ0M HUMBERi 30,162 

(C) RSrBRBBCB/DOCXBT NUNBBR: 04585/0281101 

(iJI) TBLECOMNmiZCATZOH ZNFORMRTZON t 
(R) TBXJIPBONBi (617) 542-5070 
(B) TBZXPRZt (617) 542-8906 
(B) TBZXXt 200154 



(2) ZNFORIIRnON rOR BBOOHICB ZDBNTZFZCRTZON NOMBBR: li 

(i) BBQDBRCI CHRRRCTERZSTZCSi 

(R> ZnOTBt 8 

(B) TTPSt afliino acid 

CO 8TRRHDSDNBS8S 

(D) TOPOMOYt linoar 

(si) SBQOBNCB DBSCRZPTZONt SSQ ZD HOs 1: 

Pha Lya Oly Asp Rla Bis Thr Glu 

1 5 



(2) ZBPORNRTZ R TM 8EQDB1ICB ZDBRTZrZCRTZON NUNBBRi 2: 
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(i) SEQUXHCB CaMACTERZSTZCSt 

<A) LBRGTHt X3 

(B) TYPSt amin acid 

(C) STBMDSDNESSs 

<0) TOPOLOGY s liSMU 

<ij|) FEKSOBEt 

(D) OZBSR XWORmrXGNs Xaa in position 1 is Lysins 

or Arginiaoi Xas in position 
12 is uakaomi. 

(Xi) SaOOBNCB DSSGMPTXCart 8BQ ZD HOi 2s 

Zu Ms Sor XM Ala Asp Glu Tyr Glu Tyr Mat Xaa Lys 

IS 10 



(2) ZHFOmATZOII FOR StOOBICB ZOBITZFZCATZON NUMBERS 3s 
(i) 8BQ0SHC8 CBABACTBRZSTZCBs 

(A) LBBC THt X2 

<B) TTPBi amino aeid 

(C) BTRASDBDaBSS 

(O) TOPOLOGrs liaaar 

(is) nATOBBs 

<D) OTBBR ZHFORMATZONs Xaa in position 1 is Lysino 

or Argininai ^Xaa in . 
position 10 is unknown. 

<Xi) SBQOSNCB DBSCRZPTZOIts 8BQ ZD NO: 3: 

Xaa Thr Glu Thr Sor Bar Bar Gly Van xaa Lau Lys 

1 5 10 

(2) ZRFORMATZOir FOB 8BQDBRCB ZDBRTZFZCATZON NUKBBBs 4: 
(i) BBODBIICE CBARACTBIUSTZCSs 

(A> LTOO THi 9 

(B) TYFBs amino acid 

(C) BTBANDSDllBBSt 

(D> TOPOLOoys linaar 

(isi) FZAZDBBS 

(D) OXHBR ZHFOBMATZOHs Xaa in position 1 is Lysina or 

Argiaina. 

(Xi) 8BQ0BBCB DBBCRZPTZOHi 8BQ ZD NOs 4s 

xaa Lys Lau Gly Glu Mat Trp Ala Glu 

1 S 



(2) INFOBMATZON FOB SBQOBHCB ZDBNTZFZCATZON NUMBSBt 5 3 
(i) S8Q0BNCB CBABACTBRZSTZCSs 

(A) LBWG THi 7 

(B) nPXi amino aeid 

(C) STBABDEDNBSSs 

(D) TOFOLOOYs linaar 
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(lx> PBXTORBi 

CD) OTBBR XNromHTZQNt Xaa in position 1 is Lyvino or 

Xrginino. 

(xi) SBQOBNCB DSSOUPTIONs SBQ ZD NOs St 

Xaa Lou Gly Olu Lyo Arg Al« 

1 5 



12) XH?O»H\TZ0M FOR S^DBUCS ZDSHTZPICIiTZON NDNBBRt 6t 
<i) 88QUBHCB GHMUiCTElUSTXCSs 

(A) LSMTBi 16 

(B) TXns amino acid 

(C) STRMOBDHBSSt 

(D) TOPOLORt linmr 

(iB) TBATUBBs 

CD) OTBBR ZBTORMATIOHs Xaa in position 1 is Lysine or 

Arginino. 

Cxi) S8Q0BHCB DBSCRZPTZONt SBQ ZD NOi 6i 

Xaa Zlo Lys Smr GXu Bis AXs Oly Lm Ssr Zls OXy Asp Thr Ala Lys 

X 5 XO 15 



(2) XBTORMATZOB VOR SB OPB MCg ZDBBTZBZCATZON NDNBBR: 7s 
Ci) SBQ0BBC8 CBARACTBRZSTZCSt 

CA) LBBOTBt X3 

CB) rmz aBiae aeid 
CO STRANDBDBBSSt 

CD) TOVOLOOYt Xiaoar 

Cix) FBATDRBt 

CD) OTBBR ZNPQRMATZOM: Xaa in position X is Lysine or 

Arginina. 

<xi) SBQUBHCB DBSCRXPTXONt SBQ ZD NOs 7s 

xaa AXa sar Lau AXa Asp OXu Tyr OXu Tyr Mat Arg Lya 

X 5 XO 



(2> IRfORNATZCm FOR 8BQDBBCB ZOBHTZFZCATZOR NUNBBRt 6s 
Ci) SBQUBHCB CHARACTBRZSTZCSs 

(A) LBBOTBt X6 

(B) TTFBs amino acid 

(C) STRARDEDRBSSs 

CD) TOFOLOOYs Xinaar 

Cix) FBATDRBs 

(D) OTHER IHFORMATZONs Xaa in position X is Lysine or 

Arginitia. 

Cxi) SBQOBHCB 0B8CRZFTZ Ms SBg ZD ROt 6s 
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Xu Urn hym Gly 61u Hla Pro 61y tmu smr Urn Qly Asp Val Ala Lys 

i 5 10 IS 

(2) XNFOmATXON FOR SSQOSRCS ZDSRTXFZCATION NDMBERi 9t 
(1) SBQUSKCB CHARACTBRISTZCS: 

(A) LgHC THt 13 

(B) RPSi amino add 

(C) STRANDBDHBSSi 

(D) TQPOLOCnrs linMT 

(ix) 



(D) OTHER zmORNATXONi Xm in position 1 is Lysiiio or 

Arginias snd Xmm in position 
12 is unknoim. 

(si) SBQOBHCB DBSCRXPTZ<»ll SSQ ZD KOi 9s 

Xas Mst Ssr Qlu Tyr AlA Phs Phs Vsl Gin Thr Zss Arg 

1 S 10 

(2) XNFORMATZOH FOR SBQOBHCB XDBNTXFZCATXOH NDHBBRs lOt 

(i) SBQDBHCB CBRRACtBRXSTXCSt 

(A) LBHO THt 14 

CB) TTFBt smino aeid 

(C) STRAmBDHBSSt 

(0) lOPOLOoyt linsar 

(iX) FBATURBt 

(D) OVHBR XNFORMATXONt Xss in position 1 is Lysine or 

Arginins. 

(Xi) SBODBRCB DBBGRXPTZONt SBQ XO NOi lOs 

xss Sor Olu His Pro Gly Lou 8or Xlo Gly Asp Thr Als Lys 

is 10 

<2) XRFORNATZC»l FOR SBQOBNCS ZOBRTXFXCATXW NOMBBRs lis 

(i) SBODBHCS CBRRACTBRZSTXCSi 

(A> LBHO THt 10 

(B> TTPBi smino sold 

(C> STRMDBDBBSSt 

<D> TOPOLOOys linosr 

(is) FBATORBs 

(D) OTHER ZHfORMATXOHs Xss in position 1 is Lysins or 

Arginins; Xss in position 8 is 
unknown* 

(xi) 8BQ0BBCB DBSCRZPTXQHs SBQ XD HOt lis 

Zss Als Gly Tyr Pho Als Olu Ess Als Arg 

X 5 10 

(2) XNFORHATXOH FOR 8BQ0BRCB XDBHTXFXCATXON llONBERt 12 1 
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(i) SBgOSRCB GBAIIACTERZSTZCSt 
(h) LSRGTBs 9 

(B) rmt aatao acid 

(C) StmilDEDRSSSs 

(D) TOPOLOOYi linmmr 

{is) FBATimSt 

(P) OTBBR ZNr0IUttTX09$ Xaa in position 1 ia Lysino or 

Arginimi Xaa in position 7 ia 

unknom. 

(xi) SSQUXNCS OSSCRZPTZONt S8Q ZD HQs 12 1 

Xmm Lya Lou olu PKo X.ott tmm Kim Lya 

1 5 



(2) ZHVORKHTZOH PGR SSQOSHCS ZDSSTZPZCHTZON mmSBRt 13 « 
(i) SBQOSNCB CHMACTBIUSTZCSs 

ih) UaSQTBt 11 

(B) TYPBs amino acid 

(C) StCRMIDBDHBSSs 

(D) TOPOLOns liaoar 

(ix) TEKTUmt 

(0) OTBBR ZNPOIMJiTZOBi Xaa in poaition 1 ia Lyaina or 

lurgiaina. 

(xi) SBOOSNCB DBSOaPTZONt SSQ ZD llOi 13 1 

xaa Thr Thr olu Hat Ala 8or Olu Oln 6Iy Ala 

15 10 



(2) ZHPQUMATZON POR SIQOBHCB XDSNTZPZCATZOH ROKBnt 14 1 
(i) S B QUXW C B CBARACTBRZSTZCSt 

(A) LSKGTBt 10 

(B) TTPBi amino acid 

(C) STRAHDBDIlBSSt 

(D) TOPOLOGTt linoar 

<ix) FBATURB: 

<D) OTB8R ZNfOBMATZmii Xaa in position 1 ia tyaino or 

Arginina. 

(xi) SBQUEHCB DBSCRZPTZONt 8BQ ZD KOi 14 1 

Xaa Ala Lya Olu Ala Lou Ala Ala Lau Lya 

I S 10 



(2) ZHPOBMATZON PCm SBQOBBCB ZDXNTZPZCATZON NONBBRs IS: 
(i) SBOOSIICB CHARACTBRZSTZCSt 

(A) LXHOTB: 8 

(B) TYPBt amino aeid 

(C) STRANDSDNSSSs 

(D) TOPOLOGir: linaar 
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(is) PBAtUKBi 

CD) OIBER ZNFQIUnTZ Hs Xaa in position 1 is Lyminm or 

teginino. 

(xi) SXOOBirCB OBSCaUPTZONs $BQ ZD NOs 15s 

Xaa Fho val Lm OXn JU.a Lys Lys 
1 5 

(2) ZRfORIATZON FOR SBQOSirCS ZDSNTZFZC3kTZ0H MDMBBRt 16t 

(i) SBQUSRCB CtnMCTERZSTZCSi 

(A) LSHCTBt 6 

(8) TTPBt amino acid 

(C) STRARDSDIOSSSs 

(D) TOPOLOOyi littMT 



(D) OTHBR ZHFORIIATZONt Xaa in position 1 is Lysiao i 

Arginins. 

(Xi) SMOmeE DBSCRZPTZOMt SBQ ZD NOt 16i 

Xaa tmu oly oiu Mot Trp 

1 S 

(2) znyoracATzoM por swvmncM ZDsmrzcATZoN mncBSRt i7i 

(i> SBOOB N CB CHMACTERZSTZC8I 

(A) LKTO THt 16 

(B) TTPBs amino aeid 
CO STRARDBORSSSt 

CD) TOPOLOGYs linear 

Cxi) SBQDXRC8 DBSGRZPTZORs SBQ ZD NOt 17: 

Gltt Tyr Lys Cys Lou X»ys Pho Lys Trp Pho Lys Lys Ala Thr Val Not 

1 5 10 15 

(2) ZRIDRNATZOn POR 88QDBRCB ZDBRTZFZCATZON NUMBBRt 18t 

(i) 5BQDBRC8 GHARACTBRZSTZC8 1 

CA) LBRO THt 10 

C8) TYPBs amino aeid 

CC) STRARDBDRBSSt 

CD) TOPQLOOYt littoar 

(ix) FBATDRBt 

CD) OTHER ZRFORHATZOHz Xaa in position B is unknoim. 
Cxi) SBQOBRCB DBSCRZPTZOMs SBQ ZD KOi 18t 

Glu Ala Lys Tyr Pho 8or Lys Xaa Asp Ala 

15 10 

C2) ZRPORKATZ R FOR 8EQ0BRCB ZDBRTZFZCATZON NUHBBR: 19: 
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<i} SSQUBRG8 CBMACmZSTZCSt 

|A) ZXmtBi 7 

<8> TXFBt affllao acid 

(C) StRMmSDIISSSt 

(D) TOPGLOGYs XiAMr 

(ix) FSXTtmSt 

(D) OTBBR IRFOIUIIiTZONt XftA in position 2 i8 unknown. 
(Si) SBQOSRCB DSSCRZFTZOMi 8BQ ZD 90t 19i 

Glu Xma Ly« Pho Tyr VaX Pro 

1 5 



(2) zmrojuATZw pm SEonacE zosrtzpzcatzoh mncBBRs 20 1 
CD saonncB cBMncTSRZSTZCst 



(A) LBNGTBs 26 

(B) TYPBi «alno acid 

(C) STRAHDEDRBSSt 

<D) TOPOLOCYs linear 

(Xi) S8QDEHC8 DBSCRZPTZONi 8XQ ZD ROt 20i 

Glu Lm 8or Plio Ala 8or Val Arg Lou Pro Gly Cya Pro Pro Gly val 

15 10 15 

Aap Pro Mot val Sar Plio Pro Val Ala Lou 

20 25 

(2) ZHPOBIIATZOII PM SBQUBRCB ZDBMTZPZCATZOII HOMaBRx 21: 

(1) SKttU K HCa CHMIACTSiaSTZCSt 

(A) UHGTHs 2002 

(8) TTPBs nucloic aoid 

|C) STRAimSDllBSSs ainGlo 

{0} lOPOabOGYt liaaar 

(is) 



(D) can ZHPORHATZONt N in poaitiona 21 and 32 could 

bo oithor A or G. 

(xi) SBOOBNCE DBSaUPTZONt S8Q ZD NOt 21s 

OGAAZTOCTT riTi ' lilXli I ' llllXALil MITiTiX'lXXZ tGCCCTXATA OCTCTTOGCC 60 

TTTCTGtOGT TOCATCGACT TCTTOCOCCT CCTCCTCCCA TAAACAACTC TCCXACCCCT 120 

GCACOOOCAA TAAATAAATA AA A OGAOGAO GGCAAOGGGG CAGGAC GA CO AGXGGT6C1G 180 

CCACCG6AAG GAAAAGGGAO GCAOOGOOAG AAOACCCGGG CAGAGT006A AOCGACAGCC 240 

AGAAGCCOGC A06CACCT0G CACC ATO AOA TGO CGA CGC CCC COG CCC COC 291 

Mat Arg Trp Arg Arg Ala Pro Arg Arg 
1 5 

TCC COG COT CCC CGC CCC COG GCC CA6 CGC CCC CGC tec GCC 6CC CGC 339 
Sar Gly Arg Pro Gly Pro Arg Ala Gin Arg Pro Gly sor Ala Ala ^g 
10 15 20 25 
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TOO TOO COO COO CTG C06 CTO CTG CGh OTA CTO CTO CTO CTO GG6 AOC 367 
Smr Bmr Pro Pro Lou Pro Lou Lou Pro Lou Lou Lou Lou Lou eXy Thr 
30 35 40 

GOG GOC CTG GOO COO GGO COG GOG COC GGC AAC GAG GG6 GOT CCC GOO 435 
Alo Ala LOU AIo Pro Oly Ala Ala Ala Gly Aoa Glu Ala Ala Pro Ala 
45 50 55 

GGG GCC TOG GTG TOO TAC TOO TCC OCO CCC ACC GT6 GGA TOG CTG GAG 463 
Gly Ala 8or Val Cyo Tyr 8or 8or Pro Pro 8or val Gly Sor Val Gin 
60 65 70 

GAG OTA GOT CM OGC GOO GOG GTG GTG ATC GAG GGA AAO CTG CAC COO 531 
Glu Lou Ala OlA Arg Ala Ala Val Val Xlo Glu Gly Lys Val Bio Pro 
75 80 85 

CAO OGG COG CAO CAG GCG CCA CTC OAC AGO AAG GCG GOG COG GOG GOG 579 
Olfi Arg Arg Ola Ola Oly Ala Lou Asp Arg Lyo Ala Ala Ala Ala Ala 
90 95 100 105 

OGC GAG GCA GGG GOG TOG GGC GGC GAT CGC GAG COO OCA GCC GOC GGC 627 
Oly Glu Ala Gly Ala Trp Gly Oly Aop Arg Glu Pro Pro Ala Ala Gly 
110 115 120 

CCA CGG GCG CTG GGG COG CCC GCC GAG GAG COG CTG CTC GCC GCC AAC 675 
Pro Arg Ala Lou Gly Pro Pro Ala Glu Glu Pro Lou Lou Ala Ala Aan 
125 130 135 

GGG AOC GTG CCC TOT TOG COC ACC GCC COG GTG CCC AGO OCC OGC GAG 723 
Oly rhx Val Pro Sor Trp Pro Thr Ala Pro Val Pro sor Ala Gly Glu 
140 145 150 

OCC GGG GAG QAO 000 OCC TAT CIO GTO AAO GIG CAC CAG GTG TOO GOG 771 
Pro Gly Glu Glu Ala Pro Tyr Lou Val Lyo Val Rio Ola Val Trp Ala 

155 160 165 

GTG AAA GOC GGG GGC TTG AAO AAO OAC TOG CTG CTC ACC GTO CGC CTO 819 
Val Lyo Ala Gly Gly Lou Lyo Lyo Asp Sor Lou Lou Thr Val Arg Lou 

170 175 180 185 

GGG ACC TOO GGC CAC CCC GCC TTC COC TCC TOO GGG AGO CTC AAO OAO 867 
Gly Thr Trp Gly Bio Pro Ala Pho Pro sor Cyo Gly Arg Lou Lyo Glu 
190 195 200 

6AC AGO AGO TAC ATC TTC TTC ATO GAG COC OAC GOC AAC AOC ACC AOC 915 
Aop sor Arg Tyr Zlo Pho Pho Mot Glu Pro Aop Ala Aon Sor Thr Sor 
208 210 215 

CGC GOG COG GCC 600 TTC 06A GCC TCT TTC CCC OCT CTG GAG ACQ GOC 963 
Arg Ala Pro Ala Ala Pho Arg Ala Sor Pho Pro Pro Lou Glu Tbr Oly 
220 225 230 

COG AAC CTC AAO AAO GAG GTC AGO OGG GTO CTO TOO AAG OGG TCC GCC 1011 
Arg Asn Lou Lyo Lys Glu Val Sor Arg Val Lou Cyo Lys Arg Cyo Ala 
235 240 245 

TTG OCT CCC CAA TTG AAA OAO ATO AAA AOC CAG GAA TOG OCT GGA GOT 1059 
Lou Pro Pro Gin Lou Lyo Glu Mot Lyo Sor Gin Glu Sor Ala Ala Gly 
250 255 260 265 

TCC AAA CTA CTC CTT OGG TOT GAA ACC ACT TCT GAA TAC TCC TCT CTC 1107 
sor Lyo hmu val Lou Arg Cya Glu Thr Sor Sor Glu Tyr Sor sor Lou 
270 175 180 

ACA TTC AAC TCC TTC AAG AAT 000 AAT GAA TTG AAT COA AAA AAC AAA 1155 
Arg Pho Lyo Trp Pho Lyo Aon Oly Aon Glu Lou Asn Arg Lya Asn Lyo 

185 190 195 
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CCh CAA AAT Kte M6 ATA CAA AAA AAO CCA GGG AAC TCA GAA CTT CQC 1203 
Pro Gin Amn llm Lym llm Cln tym tym Pr Oly X.ys s«r Olu Lm Arg 
200 205 210 

ATT AAC AAA QCA TCA CTO OCT OAT TCT GGA OAO TAT ATO TGC AAA GTG 12S1 
21« Asn hym him amr tma Ala Asp Bmr Gly Glu Tyr Hmt Cym Lys val 
215 220 225 

ATC AOC AAA TTA GGA AAT GAC AOT GCC TCT GCC AAT ATC ACC ATC 6T6 1299 
Urn Smx Lym Lm Gly Asa Asp Smr Ala Bmr Ala Asn Zla Thr Zla Val 
230 235 240 245 

GAA TCA AAC 6CT ACA TCT ACA TCC ACC ACT GGG ACA A6C CAT CTT GTA 1347 
Glu Sar Aan Ala Thr 3«r Thr 8ar Thr Thr Gly Thr 5ar Hia Lau Val 
250 255 260 

AAA TGT GCO OAO AAO GAG AAA ACT TTC TCT GTG AAT GGA GGG GAG TGC 1395 
Lya Cya Ala Glu Lya Glu Lya Thr Pha Cya Val Aan Gly Gly Glu Cya 
265 270 275 

TTC ATC GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG TGC AAG TGC 1443 
Pha Hat Val Lya Aap Laa aar Aan Pro Sar Arg Tyr Lau Cya Lya Cys 
280 285 290 

CCA AAT GAG TTT ACT GOT GAT COC TGC CAA AAC TAC GTA ATO GCC AGC 1491 
Pro Asn Glu Pha Thr Gly Asp Arg Cya Gin Aan Tyr Val Mat Ala Sar 

295 300 305 

TTC TAC ACT AOG TOC ACT CCC TTT CTO TCT CTG OCT GAA 1530 
Pha Tyr Sar Thr Sar Thr Pro Pha Lau Sar Lau Pro Glu 
400 405 410 



TAGGAGCATG 


CTGAGTTG6T 


GCTGCTTTCT 


TGTTGCTGCA 


TCTCCCCTCA 


GATTCCACCT 


1590 


AGA6CTAGAT 


GT6TCTTACC 


AGATCTAATA 


TTQACTGCCT 


CTCCCTGTOG 


CAT6AGAACA 


1650 


TTAACAAAAG 


CAATTOTATT 




TT06C6ACTA 


GTTGGCTCTO 


AGATACTAAT 


1710 


AOGTOTCTCA 


GGCTCCGGAT 


GTTTCTGGAA 


TTOATATTGA 


ATGAT8TGAT 


ACAAATTGAT 


1770 


AOTCAATATC 


AAGCA6TGAA 


ATATCATAAT 


AAA6GCATTT 


CAAAGTCTCA 


CTTTTATT6A 


1830 


TAAAATAAAA 


ATCATTCTAC 


T6AACAGTCC 


ATCTTCTTTA 


TACAATOACC 


ACATCCTGAA 


1890 


AAOOOT6TTG 


CTAAGCTGTA 


AOOBATATGC 


ACTTGAAATG 


ATGGTAAGTT 


AATTTTGATT 


1950 


GA0AAT0T6T 


TATTTOTCAC 


AAATAAACAT 


AATAAAAfiGA 


AAAAAAAAAA 


AAA 


2003 



(2> ZHPORNATZOIf FOR 88QtlIllCB ZDSHTZFZCATZON NUMBSRl 22 1 
(1) SZgamCB CaAHACTERlSTZCS: 

CA). LIHGTBt 12 

(B) TtPSt amino acid 

(C) STBANDEDNSSSt 

<0) TOPOLOGYt linaar 

(ix|. PBATORBt 

(D) OTHSR XHFORMATZONt Xaa in position 11 is unknown. 
(Si) 8BQ0SVCS OBSCRZPTZONi SBQ ZD NOi 22: 

Ala Sar Lau Ala Asp Glu Tyr Glu Tyr Mat Xaa Lys 

15 10 
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(2) XmrORIATZOn for SBQOBHCB ZDBHTXFZGATXON NUHBBRs 23 1 
(i) SEQDBNCB CHMACTERZSTZCSt 

(B) TYPSt amino acid 

(C) aiBKmrnmsst 

(D) TOMLOmt limsr 
(is) nhTOBBt 

(D) OXBBR XnrailATZOIIt Xmm in position 9 is iinknoim. 

(Xl) SBQOSIICB DBSGRZPTXONt SBQ XD NOt 231 

Thr Olu nr 8or 8or 8or Oly Lm xu Ziou Lvs 
1 5 10 ' 

(2) XRrORMATXOH FOR SBQOBHCB ZDBRTXFXntXON NOMBBRi 24s 
. (i) SBQOBHCB CBMACTBRXSTZCSt 

(A) LBHC THt 12 

(B) TTPBs amino acid 

(C) STRAHDBDHBSSi 

(D) tOPOLOOYs linoar 

(Hi) SBQOBHCB DBSCUPTXOHt 8BQ XD HQs 24t 

Ma Sor Loa Ala Asp Olu Tyr Olu Tyr Hat Arg Lys 

15 10 

(2) XHFOBMATXOH FOR SBQOBHCB XDBHTXFXCATXOH ROMBBRt 25t 
(i) SBQOBHCB CBARACTBRXSTXCSt 

(A) LBHO THi 9 

(B) TYPBi amino acid 

CD) TOPOLOoyt linear 

(is) PBATORBt 

(D) OTHBR XHFORMATXORt Baa in position 7 is unknown. 
(Xi) SBQOBHCB OBSCUPtXOHs 8BQ XD HOt 25t 

Ala Oly Tyr Pho Ala Olu Baa Ala Arg 

15 

(2) XHFORMATXOH FOR SBQOBHCB XOBHTXFXCATXQN NOMBBRi 26t 

(i) iSBQOBHCB CBARACTBRXSTXCSt 

(A) LBRGTBt 10 

(B> TTPBi amino acid 

(C) STRAHDBDHBSSt 

ID) TOPOLOOYs linear 

(Xi) SBQOBHCB OBSCRXPTZOHs SBQ XO NOi 26 1 

Tlir Thr Olu Mot Ala. Sor Olu Oln Gly Ala 

1 5 10 
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(2) Zjm>WknOH FOR SBOHBHCS ZOBMTZrXCJlTXOtl mnCBBRt 27 1 

(i) SEQDBRC8 CBMACTBRZSTZCSt 

<A) LBHSTHt 9 

<B) TYFBt Amino add 

<c) STMmmmsst 

(0) TOPOLOert ILammr 

(Xl) SBQUEHCB DBSCRZPTZOlls SSQ ZD HOt 27t 

Ala Z.y8 OIu AXa Leu Ma Ala Lau l*ya 

X 5 



<2) ZraORMATZON FOR SBQOBHCB ZOBNTZrZCATZON NOHBnt 201 
(1) SBBOmCB GBABACTBiaSTZCSt 

(A) LSRGTBt 7 

(B) TTPSt amino aeid 

(C) sTMHommsat 

(0> TOPOLOGY: linaar 
(Xi) SSQOENCB DSSCRXPTZONs SSQ ZD NOs 28s 

Pha Val X.au Gin Ala Lya Lya 

1 S 



(2) ZmrOBMATZCm FC» SBQUSRCS XDSMTZPZCATZOM NUMBBRs 29s 
|i) SBQDBHCS CHARACTSRZSTZeSi 

(A> UNOXBi 21 

(B) TYPBs amino acid 

(C) STBARDBDRBSSt 

(D) TOPOUSGYi linaar 

<xi> SBQUEHCB DBSCRZPTZOMs SBQ ZD NOt 29t 

Glu Thr Gin Pro Aap Pro Gly Gin Zla Lau Lya Lya Val Pro Nat Val 

15 10 IS 

Zla Gly Ala Tyr Thr 
20 



(2) ZOTOmiATZOII FOR BBflOBHCB ZORNTZFZCATZON miMBBRs 30s 
(i) SSQUBNCB CBARACTBRZSTZCSs 
(A> LBNBTHs 21 

(B) TTPBs amine acid 

(C) STRAMDSDNBSSs 

<D) TOPCLOOYs linaar 

|ix) FBATDRBt 

(D) OXHBR ZNFORNATZOlii Xaa in poaitiona 1, 3, 17 and 

19 ia unknoiin. 

(Xi) SSQDBirCB D8SCRZPTZ II s SBQ ZD II s 30s 

Xaa Glu Xaa Lya Glu Gly Arg Gly Lya Gly Lya Gly Lya Lya Lya Glu 

1 . 5 10 15 
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Xmm Xy Xm Oly hym 

20 



(2) ZNfOHMATZOM FOR 58Q0SRCB lOXHTZPICXTZOH NUMBBRs 31s 
(1) SSQOSHCE CBMliCTSIUSTZCBi 

(A) LBKO THt 13 

(B) TtPSt aaiiio acid 
<C) SXRAHDSDSBSSi 

(D) lOPOLOnt liMmr 

ixi) SBQOnCS DXSCRIPTZOiri 8BQ ZD MOs 31t 

jQa oitt Lys Glu Lys Thr Pte Cys Val Asn 61y oly Olu 
I S 10 



(2) ZNPOlOgkTZOH FOR SBODSHCS ZDRRTZFZCATZOH MDMBXRt 32s 
<i) SEQOBRCB CBMACTlRXSTZCSs 
iK) LSROTBt 8 

(B) rm t_ amino acid 

(D) t3oS«"^* lisMT 
FBRTimBs 

(O) omSR zmromBTZOIli laa in position 6 io unknoMB. 

(Ki) SBQOBHCS DBSGRZmOllt 8BQ ZD MOs 32: 

Lys Lm Glu Pte Lau Xan Kla X.ya 
1 5 



(2> ZRPORIATZON FOR SB^^SRCB ZDBRTZFZCRTZM NOMBBRs 33: 

(i) SEQUENCE CBRRaCTERZSTZCSt 

(K) LBBCTR: 9 

IB) TypBt amino acid 

(C) 8TRMIDSDNB88: 

(D) lOMLOOlt liamr 

(1JI> FEKnRBt 

(D) oraSR ZNfORIIRTZOlli Zaa in position 1 ia Lyaino or 

Arginina* 

(Xi) 8B00XIIC8 DBSCRXPTIONi 8XQ ZD NOi 33: 

Xaa Val Hia Gin Val Trp Ala Ala Lya 

1 5 



(2) ZHFORMATZOIf FOR SEQBBRGS ZDSNTZFZCATZON NUMBER: 34: 
<i) SBQOBNCS CBARACCBRZSTZCS: 

(A) IBNOTB: 14 

(B) TtPB: amino acid 
(C> STRARDEDNB88: 

(D) TOPOLOGYs linaar 
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(iX) RATORSt 

(D) OTHER XNVtmmTX Ml Xaa in position 1 it Lyaiao or 

Arginine, Xaa in position 11 
is unknoim. 

(Xi) 8BQUBRCB OSSCRZFTZONt SBQ XO HOl 34t 
Xmm Tyr Xlo Pho Pho Mot Glu Pro Glu Ma. xu Sor 8or Gly 

15 10 



(2) XMPOMATZOM KM SBQDBMCB ZDBMTZFZaTZON MONBSRs 3St 
(i) 8SQDSMCB GBMRCXBRZSTZeSi 

(A) LBMGTHi 14 

(B) TTPBt soino acid 

(C) STRMIDBDMBSSt 

(D> TOMLOGyx linoar 

<ijc> PBATORMt 

(D) OTBBR ZMPORMATZORf Xaa in position 1 is Lysino or 

Arginino, xaa in position 13 
is unknown* 

(Xi) SBQOBMCB OSSaaPTZOHt SBQ ZD ROs 35i 

Xaa tott Gly Ala Trp Gly . Pro Pro Ala Plio Pro Val Xaa Tyr 

IS 10 



(2) ZRPGRMATZOM FOR SBQIIBMCB ZDBNTZPZCATZOM RtlMBSR: 36t 

(i) SBODBMCB CBARACXSRZSTZCSs 

<A) LBMQTBt 9 

<B) TTPBt amino acid 

(C) 8TRAMDBDRBS8I 

CD) TOPOLOGTs linoar 

(iJC) FBATORBt 

(D) OTHER ZMPORNATZOMt Xaa in position 1 is Lysino or 

Arginino* 

(xi) SBQUBNCB DB5CRXPTZON: SBQ ZD NOi 36 1 

Xaa Trp Pho Val Val zio Glu Gly Lys 

1 s 



(2) ZNIORNATZOM FOR BBQOBReB ZD8NTZFZCATZ0M RQMBBRt 37s 
(i) SBQ^CB CRARACTSRZSTZCSt 

(A) XAIGTHt 16 

(B) TTPBi axftino acid 

(C) STRAMDXDNSSSt 

(D> TOPOLOQYi linear 

(ix) FBATDRBs 

(D) OTBBR ZHPORNATZONt Xaa in positi n 1 is Lysino or 

Arginino. 
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(Xi) 8BQUBHCE OXSCRZPTZONi SSQ ZD NOi 37t 

Xwk Ala Ser Fr Val Ser Val GXy Smr Val ein 61u L u Val Gin Aro 

1 5 10 15 

(2) ZmrORMATZm FOR SBQUSNCB ZDENTIFZCATZON NUMBXRs 38 1 

(i) SKQOmCS CHARACTSRZSTZCSt 

(A) LBWCl Hi 13 

(B> TYPBi amiao acid 

(C) STRAMDXDllBSSs 

(D) tOMLOm Xiiwar 

(in) fiAxont 

<D| onn ZmoittlATZOIIs Xaa in position 1 is Lysino or 

Argiaias* 

(xi) SSQUBHCB OBSCRZPTZCais 8BQ ZD HOi 3at 

Xas VaX Cys ZM Thr VsX Ala AXa Lou Pro Fro Thr 

X 5 XO 

(2) ZNFORMATZOir FOR SBSDBRCB ZOBMTZFZCATZON NDMBBRt 39l 
(i) SBOOmCB CHARACTBRZSTZCSt 

(A) LBBO THt 7 

(B) RPBs amino acid 

(C) STRANDSDBBSSt 

(0) nPOLOOYt Xinoar 

(is) 



(D) OTHER ZNFORMATZONt Xaa in position 1 is Lysiao or 

Argininsj Xaa in position 6 is 



(xi) 8BQ0BBG8 OBSCRZPTZONi 8BQ ZD MOt 39t 

Xaa Asp Lou Lau Lou Xaa VaX 

X 5 

(2) ZHFOIMATZON FOR SBQDBRCB ZOBRTZFZCATZOIt HUMBBRt 40i 
(i) 8BQUXNCS CBARACTBRZSTZCSs 

(A) LKHCT Hi 39 

(B) TTFBt amino acid 

(C) STRANDXDllBSSx 

(D) TOFOLOOTi Xinoar 

(xi) SBQDSRCS OBSCRZPTZmi 8BQ ZD MOt 40i 

Cys Thr cys GXy Cys Cys Lya Cys Cys hrg Tlvr Thr Cys AXa Cya Arg 

is XO XS 

Cya AXa GXy AXa AXa GXy GXy Thr Cya Thr Thr Cys Thr Cys Cys Thr 
20 25 30 

Thr cys Thr Cys AXa GXy Cys 
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(2) tnomhtiw won sbqubbcs ZDxmmcATZ n nombbri 4it 

<i) SSQOBIfCB CBMIACXBRZSnCSt 

(A) LSRCTBt 24 

<B) TYPBt «miiio acid 

(C) STRAHDBDHBSSt 

<D) TOPOZMYs liMar 

(Xi) SBODBRCB OBSCRIPTZONi SBQ ZD ROt 41s 

Cys Cys Thr Cyo Oly Cye Thr Cys Cys Thr Thr Cy« Thr Thr Cys Thr 
15 10 IS 

Thr Oly Cys Cys cys Thr Thr Cys 
20 



(2) zmmKhTZOB rmt sboubncs ZDBNTzrznrzoN mncBBRt 42 1 

(1> SBOOBNCB CBMttCTBRZSTICSi 

<A) LBBGTHs 8 

(B) TYPBi aaino acid 

(C) BTmBDBDBBBSi 

<D) TOPQLOOTt liAMr 

Cxi) SSQOBRCS OBSGBXmONs 88Q ZD NOs 42t 
Viil His Ola Val Trp Ala Ala Lya 



(2) ZBFmKATZn fOR SBQDBRCB ZDBBTZPZGATZON NOMBSRs 43l 
(1) SBOOBNCB CBARACTBRZBTZCSi 

(A) LBBC THi 13 

(B) TTFBi amino acid 

(C) 8TRAIIDBDNB88I 

|D) TOPOMoyt linaar 

(ijc) RATOBBt 

(0) QTBBR ZHraUfATZONi Xaa in poaition 10 la unknown, 
(xi) SBVIBMC8 DBSGRZPTZOHt SBQ ZD BOt 43 1 

Tyr Zla Pha Pha Mat Glu Pro 61a Ala Xaa Sar 8ar Cly 

IS 10 



(2) ZNPORMATZON POB SBQOBBCB ZDBBTZPZGATZON NONBBRs 44 1 
(i) SnmnCS CBABACTBBZSTZCSf 

(A) LBNOTBt 13 

(B) TTPBt amino acid 

(C) 8TRABDBD1IB88I 

<D) TOPOXXXSTt linaar 

fix) PBATOBBi 

(D) OTBBB ZNFOBMATZOIIt Xaa in poaition 12 ia unknown, 
(xi) SBQOBIICB DBSCBZPTZONs SBQ ZD BOt 44 1 
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Lm Gly Ala Trp Gly Pro Pro Ala Pho Pro Val Xu Tyr 

15 10 



(2) XHfORKAnOII FOR SBQOBIICB ZD8RRF2CATZ« HUMSmi 45i 
' (1) SBQOBRCB CBAIUkCTBIlZSTZCSJ 

(A> LglC TBt 8 

(B) Ttnt amino «eid 

(C) StBARDBDllBSSs 

(O) TOFOLOGTi liaoor 

(xl> SaOfmCB DBSGRZPTZOIft 88Q ZD NOi 45t 

Trp Pto Vol VAl Zlo Oltt Oly Lym 
1 5 



(2) ZMPOWlATZOn fOSt SSQOBIiCB ZOBNTZPZCATZM HOMBSAs 46 1 

(1) SSQUBHCB CaARACmZSTZCSt 
(A) LBNGTBt 15 
IB) TYPB: amino acid 

(C) STRANDBOBBSSt 

(D) TOPOUXnrt linoar 

(Xi) 8BQ0BBG8 DB8CRZPTZ0M; SBQ ZD NOt 46s 

Ala Sar Pro Val Bar Val Gly sar Val Gin Glu Lau Val Gin Arg 

15 10 15 



(2) ZMyORMATZON FOR SBQOBIiCB ZOBMTZFZCATZON NOMBBRi 47 1 
(i) SBQOBRCB CBARACTSRZBTZCSt 

<A) LBBGSBi 12 

(B) TYPB I amino acid 

(C) 8TRARDBDRB88I 

<D) TOPOlXXnrt linaar 

(Xi) SBQOBRCB OBSCRZPTZONs SBQ ZD NOt 47t 

Val Cya Lau Lau Thr Val Ala Ala Lau Pro Pro Tbr 

1 5 10 



(2) ZNFOBKATZOR FOR 88Q08NGE ZD8BTZFZCATZ0R ROMBBRt 48s 
(i) SBQOBRCB CBARACTERZSTZCSt 

(A> ZXHOXBt 9 

(B) TnBt amino acid 

(C) STRANDBDNESSt 

(D) TOPOtOOyt linaar 

<xi) SBQOBRCB 0B8CRZPTZ0N: SIQ ZD ROt 48t 

Lya Val Bis Gin Val Trp Ala Ala Lya 

1 5 



(2) ZRFORMATZOR FOR SBQOBRCB ZDBRTZFZCATZOR ROMBBRi 49t 
(i) SBQOBRCB CHARACTSRZSTZCSt 
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th} LSRorat 13 

(B) TYFBt aalBO Mid 

(C) STRAMDSDNBSSf 

<D> TpPGLOGYt liMar 

(ix) FBATORBi 

(D) OTHER XNTORMATZORs XM in position 12 is unknoifn* 
(xi) 8XQUBNC8 DESCRZPTIORs SSQ ID HOs 49 s 

Lys JUft Sor Lm Ala A«p Sor Oly Clu Tyr Mot Xaa Lys 

1 S 10 

(2) ZBPOiQIMnON FOR SW8NCB XDBKTZTZATZOII NOMBSRs 50t 
(i) a aO OMCB CHMRCTBRZSTZCSt 

(A) ZMOTHs 6 

(B) nvBt aaloo acid 

(C| STRMIOBDllBSSt 

<D| TOMLOGYs linoar 

lis) PBAtORBi 

<0) omBR ZMTORmnOllt XU in position 5 is 

unknoim* 

(Xi) 8BQUBHCB DBSCRZPTZOH t SBQ ZD IfOt SOt 

Asp Lm Lou IM Zm val 

1 S 



(2) ZmORIIATZON FOR SSODBIICB ZDBRTZrZGATZON RDMBBR: 51 1 

(i) 8BQDBMCB CHMACXBRZSTZCSi 

(A) LBHOTHt 20 
IB) TTPBs nuclsic acid 

(C) STRARDSDNBSSt singlO 
(0) TOPOLOOYt linsar 

(xi) SBQnSHCB DBSCRZPTZONs SBQ ZD NOt Sit 

TTTAARGGNO AYGCNCAyAC 



(2) ZNfORMATZON FOR SBQUBHCB ZDBmZFZGAXZON ROMBBRt 52 1 
(i) SBQtnOiCB CBMIACTBRZSTZCSs 

(A) LBHOTBt 21 

(B) TYFBt nuclsic acid 

(C) STRRNDBPMBSSs siagls 
ID) TOFOMOYc linoar 

(xi) 8BQDBRCB DBSCRZFTZONi SBQ ZD NOi S2i 

CATRTAYTCR TAYTCRTCNO C 



(2) ZBFORMATZ H FOR SBOOBNCB ZDBHTZFZCATZOB NUKBXRs 53 1 
(i) SBQOBRCB CBARACTBRZSTZCSi 
(A) LBNOTB: 20 
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(B) TTPEt nucittie aeid 

(C) STRARDEONBSSs minqlm 

(D) TOPOLOCYt llAttAT 

(Mi) SBQUSRCB DESCRZPTZOMs 8BQ ZD NOt 53s 
TOYTCNQANG CCATTTGN6T 



(2) ZHfOmMZOH FOR SBQ^JBRCS ZDnTZPZCATZON NUMBBRi 54 1 
(i) BBQDBIfCB CHMACXBRZBTXeSi 

(A) LBHg rat 20 

(B) imt auelsie acid 

(C) snMDBDBBBSt siagXs 
<0) tOFOLOOTt Ximar 

(Xl) SBQDBirCS DBSGRZPTZOMt 8BQ ZD BOs 54s 

TCTTCRCTBO OCXnTCNOT 



(2) ZHTORMATZOB FOR SBQUBHCB ZOBRTZFZOiTZON NUMBBRs 55 1 
(i) SBQDBNCB CHJUUCTBRZSTZCSs 

(A) LBmiBs 20 

(B) TCnt nuel«ic aeid 

(C) STRAHDBDMBSSs Siligla 
CO) TOFOLOOyt liiwar 

(xi) 8BQ0BRCB DBSCRZFTZOlit 8BQ ZD BOs 55s 

OCDRTNACCA TBGCHACTTT 



<2) ZRFmMATZON FOR 8BQDBHCB ZDBBTZFZCATZOM NUMBBRs 56s 
(1) 8BQ0BBC8 CHIimCTBRZSTZCS t 

(A) X.BB6TBt 20 

(B) TYFBs nuclaic aeid 

(C) STRANDBDBBSSs aingla 

(D) TOFOLOOTs linear 

(Xi) SBQDBNCB OBSCRZPTZONs SBQ ID BOi 56 s 
GCNGCCCANA CTTCRTOBAC 



(2) ZRFQRMATZON FOR SBQOBIICB ZDBNTZFZCATZON NONBERs 57s 

<i) 88Q0BRCB CBARACTBRZSTZCSs 

(A) LBNOTBs 20 
<B) nPBs aueleie aeid 

iC) STRABDBDNBSSs Single 
(D) TOFGLOOYs linear 

Cxi) BBQOEBCB DBSCRZFTZOBs SBQ ZD BOs 57s 

GcrrcsooYT CCATRAARAA 



C2) INFORMATZON FOR SBQOBNCB ZOBRTZFZCATZON NmCBBRs 5Bs 



SUBSTITUTE SHEET (RULE 26) 



wo unan FCT/us94/osaa3 

- 104 - 

<i) swaama caiaucmzsTzcst 

(A) LBHGTBs 20 

(B) mzt nueXsie acid 

(C) STRARDBDllBSSi ailigl* 
(O) TOPOLQCyt XlMar 

(xi) SBQOSNCB DSSCRZPTZOH: SBQ ZD NOs 58 1 

CCnCDATBUi CSOkCSMCCh 20 

<2) ZnraUATZOII for nOOBRCB ZOSRTZFZCJiTZON NOMBBRs 59 1 
(i) SBQDBMS GBMMXBRZmCSi 



(B| TSni nuelBic acid 

(C> SCTMDBDHBSSt minqlm 
CD) TOVOLoeri iinur 

(Xi> Sg OUlMCE DBSCRZPTZONs SBQ ZD HOs 59t 
TCHGCRMRT MOCRCC 17 

(2) ZmCBNATZON VOR SEQOBHCB ZOBNTZPZGATZON BUMBSRt 60 1 

(1) SBQUBRCB CBIOIACnnaSTZCSt 

(A) LKHQ THi 20 

(B> TYFBi oucXbIc acid 

(C) srmRDBDHBBSi slngla ^ 

(D) TOBOLOCTs lillMr 

(Xi) SBQOBNCB DB8CRZPTZQNS SBQ ZD MOs Ms 

GGNGcmoMc cncmiiGc 20 

(2> ZNrORMATZOM FOR 88Q0BRCB ZDBNTZFZCJITZON NOMBBRt 6Xt 
(1) SBQOEIieB'CHMUCTBRZSTZCSt 

(A) LKBO tHt 20 

(B) RFBi nueX^ic acid 

(C) STRHnDBDRBSSt SiJigXa 
<D) TOBOLOGyt XiMar 

<Xl) SBQOBIICB DBSCRZPTZONi SBQ ZD VOt 61i 

CCNOCYRMIO CITCZTTNUC 20 

(2) ZNFORKATZOB FOR S8QUBRCE ZDBNTZFZCMZOH NOMBBRs 62s 
<i) SBQUBHC3B CHRRRCTBRZSTZC8 1 

(A) XARGtBs 20 

(B) TTPBs nucXaic acid 

(C) STRANDBDBBSSi aingXa 
(D> TOFQLOOYs Xlnaair 

(xi) SBOOBIfCS DBSCRZPTZONs SBQ ZD NOs 62s 

TT Y TT WWIT CmONACRAA 20 
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(2) ZNTORMXTXCm FOR SEQUBNCE XOENTZFZCATZOR RDNBEIIt 63 1 

(i) SSQtTBNCB CBAmCTBRZSTZCSs 

(A) LSHGTBt 20 

(8> TtPZn nuelttic acid 

(C) SlRAHOSDNBSSt slngl* 

(D) tOPOILOms liMW 

{Mi} SBQUnCl OBSCilZPTZCnrt SBQ ZD not 63s 
ll'iTTNGCtT GXMXMSM 20 

(2) ZHfOMATZOn TOR SIQamCB ZOSNTZPZCATZON NUMBSRt 64 1 

(i) smmw c i cHMUcxnzsTzesi 

(A) IXHO THt 17 

IB) RPBt nucl#ie'aeid 

(C) SntAIIDBDRBSSs singla 

(D) TOPOZMYi liMar 

(xi) S8Q0BNCB DBSCRZPTZONs SEQ ZD NOt 64 1 
TSHACNAOn CnOSAC 17 

(2) znromATZOR fw sbodbhcb zobntzpzcatzoii mniBBRs 6St 

(i) 88QIIBBCB GHAMCXSUSTZCSs 



(A) LlWOTHi X7 

(B) TTPBi aucXaie acid 

(C) gCTAM PBDMBSSt Bingla 
(D> tOPOLOOTi XiBMr 

(xi) BBaOBHCB DBSCRZFTZOIIs SBQ ZD ROt 65 1 

TGBAOrAAirT CnOHAC 17 

(2) ZHfORKATZGN FOR SBQOBBCB XDBHTZFZCATZON NUMBBRi 66s 

(1) SBQUmiCB CBARACTBRZSTZCSs 

<A) LBKO THs 21 

<Bi TTPBs luiciaic add 

<C) 8TRARDBDBB8SS aingla 

(D) tovOLOOri linaar 

(xi) SBOOBHCB DB8GRZPTI0NS 8BQ ZD MOs 66s 
GATBTAnCN OCRQARTCNO C 31 

(2) ZNFORKATZON FOR BBQOBIICB ZOBNTZFZGATZON RUMBERs 67s 
(i) 8BQIIBRG8 CSABACTBRZSTZCSs 

(A) XJBHOTBs 21 

(B) TTFBs nucXaie acid 

(C) STRANDBDMBSSt aiagXa 

(D) TOPOLOGnrs Xinaar 

(xi) SBQOBNCS DBSOUFTZONs SEQ ZD N t 67s 
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(2) XRVOBIATXOH TOR SBQXmHCB XDBNTXrZCHTZOM NOMBBRi 6Bt 
CD SBQUBNCE GHMUiCTBRXSTXCSt 

(A) LSBCTBt 21 

(B) rmt nuclsie acid 
<C| STIUNDSDHSSSt flliigl* 

(D) TOMZdOnt liiwar 

(si) SBQUBRCS DBSOaPTZOVt 88Q XD NOt 681 



(2) xnrcmnrzoR wr sbodbkb xdbntzpxcatxoh hombbri 69t 

(1) SBQ0BBC8 CBMBCTERXSTZeSt 

(R) ZJtHGtHt 21 

<B) Ttnt nuclBle acl4 

(C) STRMlDBOBBSSs •inglB 

(0) TOPOZMYt liMBT 

(xi) SBQOBNCE DBSCRXPTZOHt SBQ XD ROt 69 s 

RGMOCMOCN RCNOMIOCn T 

(2) XHfORICRTZaN VOR SBQIIBRCB XDBHTXrXCATXOM NUMBBRs 70s 
(1) SBOOBRCB GBMRCTBRXSTXeSi 

(A) LBHO THs 21 

(B) TtPBs BttclBle acid 
<C) snUHDBDMBSSs Ringl* 

(01 TOWEdOOys llnrar 

(si) SBQOBRCB OBBGRXFTZORs SBQ XO ROs 70s 
RCTRTCMGCSf ARRORIIOCyT T 



(2) XRFORMITXOH FOR 8BQUBRGE XDENTXrX«TXOM MOMBBRt 71t 

CD SBQOBRCB CBM»CTBRZ8TXC8s 

CA> LBHO THl 21 

(B> nVSt mtclBie ftcid 

CC> 8TRMiDB0RB88s BlnglB 

CD) TOPOtOOyt llMAT 

(xD SBQOBKCB DBSCRZPTZORS SBQ XD ROt 71s 

RCTRTCR6CB AGROAROCn t 



(2) XNVORMXTZOR TOR 8BQ0BRCB XDBRTXFXCATXOR NUMBBRs 72s 
ID SBOOBRCB CBRRRCTBRZ8TZC8S 

(A) IXBOmt 21 

CB) TTPBs naelsie acid 

CC) STRMlDBDRBSSi singla 

CD) TOPOLOGY: liBBar 
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{Xi) S8QUSNC8 OSSCRXPTXOHt 8BQ ZD ROt 72t 
KGJUIZOI6CT AARCTRGCTT 7 

(2) ZNTOMATZOlt FOR SBQOBNCB ZOBNTZFZCATZON miHBBRs 731 

(1) SMffma CHMACTBiUSTZCSs 

{K) UBBQTBi 21 

(B> TTPSt auclaic acid 

<C) SnUkHDSDNBSSt siiigl^ 

(O) tOPOLOOTt liBMT 

(si) SBQOBIICS DBSCRZPncaii SBQ ZD not 73r 

HGMITCBOai HgRCT HO CT T t 

f2) XNFORIATZOII rOR SSQOSHCE ZDB8TZPZCATZ0N MmSBIIt 74s 
(i) 8BQDSRCB CHMACTSRZSTZCSi 

|A) tKHC THi 21 

(B) nPBi auclsic acid 

(C) STRAH DBDBBSSt single 
(P) TOPOZMtt lioMr 

(si) SBOUBRCB OBBCRZPTZQNl 8BQ ZD ROt 74s 

BCXSXGBGCT MRCmOCYT T 



(2) IMfOkMRTZCV fOR SBQOBNCS ZDBBTZPZCJiTZOil NOKBBBs 75s 
(i) SBQOBVCB GHBRBCXBRZSTZeSt 



(A) ZXKGTBt 21 

(B) Tmt nuelaie acid 

(C) SmailDBDlfSSSi singla 
(D> TOMLOOys linear 

(xi) SBQUBNOS DBSCRZPTZONt S8Q ZD NOt 75s 

BCTRCTNGCN AORCTNOCn T 

(2) ZBFORmRW im SBOOmCB ZDBBTZPZCATZON KDMBERs 76s 

(i) BBOOBBCS GHBlUICTBilZSTZCSt 

(A) ZAIGTBs 20 

IB) TRBs sluelaie acid 

(C> STRABDBDHBBSs Singia 

ID) TOPOLOors liaaar 

(Xi) 8BQDBNCB DBBOZPTZONs 8BQ ZD HOs 76s 
ACHACMGARA 7OGCTCNN0A 

(2) ZNfORMATZON FOR SBQUBNC8 ZDSNTZPZCATZON NOMBSRs 77: 

(i) SBQUSNa CBARACTBRZSTZCSs 

CA) Z«GTBs 20 

IB) TTPBs auelaie acid 
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(C> STRMfDSDMESSf singlo 
(D» tOFOLOOirs XlMar 

<xl> SBQimCB OSSCRZRZONi 88Q ZD ROt 77i 

XCKXCRGARA TOGCHGYIIOA 20 

(2) IWtOBmnOH ffOR S80mC8 ZDENTZPZCMZOH NONBSRs 78 1 

(1) SSOOBRCB CBMACmZSTZCSi 

ih) LKRO THt 20 

<B) TtPBt nuclttip aeld 

<C) STRAHDSDBSSSx slttgltt 

(D) TOPOLCOYi ILnmmr 

(Mi) SMOnnCS PSSCRZPTZOII: S8Q ZD HOi 78i 

CMGauRonr gggcirscrm 20 

(2> ZRTOMIXTZCm VOR SBODSNCB ZDBNTZPZnTZOH HQMBBRs 79s 
(i) SBQOSNCB CBMACTBRZSTZGSs 

(A) tggO tBt 20 

(B) mst tmeXeic acid 

(C) StBMDBOSBSSt slagl* 
<0) tO9OL0OYs llnsar 



(Xll SBQIIBKB DBSCRZPTZOHs 8BQ ZD MOt 79t 
TTYGTMOTNA THOMGimAA 20 

(2) IHraUATZOII MR SB0 WPI C 8 ZDBNTZPZCAtZDN NOMBBRi BOi 
(i) SBffmm CBARACmiZSTZCSt 

(A) LBWO rai 20 

(B) TTFBs nucleic acid 

(C) STRAIfDBOBBSSt Single 

(D) TOPOLOOTi linear 

<xi) SBQ0B8CB DBSCRZPTZONt SBQ ZD NOs 80 1 
AARdONQAarG dtCAYACMGA 20 

(2) ZHrOBMATZON FOR SBQDERCB ZDBNTZPZCATZGR NIMBERs 81s' 

<i) SBQUSRCS CHARACTBRZSTZCSS 

(A) LgBG TBs 20 

<B) TTPBs nucleic acid 

(C| STRANDBDlfBSSt aingle 

(D) TOfCSUOatt linear 

(xi) SBQDBHCE DBSCRZPTZONs SBQ ZD ROs 81i 

0AR6C H YTO0 CNGCRraiAA 20 

(2) ZR90RMATZ0N VOR SBQUBRCB ZDBRTZPZCATZOII NOMBBRs 828 
(i) SBQOBHCB CBARACT8RZ8TZCSS 



SUBSnrUTE sheet (rule 26) 



W0 94/2tf298 

- 109 - 

(A) LERCTBl 20 

(B> TtPBi nucleic acid 

(C) STmUDEOflBSSi single 

(0} TOPOLOOyt linMT 

SBgUSnCB DSSCUFTZOMx SBQ ZD NOs 82s 

G T BOON TC HO THcaimiuurT 



(2) ZNFOilMltTZON FOR SSQOBNGB ZDBIITZrZCATZOlt MUMBBR: 83s 

(1) ssooncB CBMAcnaasTzcss 

(A) LggC THs 20 

(B) TCT gt naclele acid 

(C) STRARDSDHBSSs single 

(D) TOPOLOGYs lineer 

(Jli) SBQOERCE DRSCRZPTZON: SBQ ZD NOs 83: 
GTROONAGW tRCARCARyT 20 



(2) ZRtmMAnON FOR 8BQ0BMCE ZOBMTZFZCATZON NOMBBR: 84s 
(i) SBOOBRCB CBARACTBRZSTZCSs 

(A) LBHG THs 21 

(B) nFBs nucleic uid 

(C) StRAHDBDNBSSs siitgle 
(01 TOPOUms linear 

(Ri) SBOOBNCB DBSaZFTZOHs SBQ ZD NO: 84: 

RACXI'ITTIN ARDAXnONG C 21 



(2> ZNFORMATZON FOR SBQOBNCE ZDBNTZFZCATZON NUNBBRs 8Ss 
(i) SBC^IBNCB CBARACTBRZSTZCSs 

(A) LBBGTBs 417 

(B) TTPBs nucleic acid 
<C) STRAHDBDNBSSS single 

(0) TOPOLOOTs linear 

(ix) FBATORBs 

(D) 0T8BR ZNFQRKATZONs Zaa in positions 14, 23, 

90, 100, 126, and 135 is a 
stop codon. 

|Ki) SBQOBNCE DBSCRZPTZONs SBQ ZD NOt 8Ss 

TCXAA AAC TAG AOA GAC T6T ATT TTC ATG ATC ATC ATA 6TT CT6 TGA AAT ATA 53 
Aan Tyr Arg Asp Cys Zle Phe Met Zle Zle lie Val Leu xaa Asn Zle 

1 5. 10 15 

CTT AAA COO CTT TOG TCC TGA TCT TOT AGO AAG TCA 6AA CTT CGC ATT 101 
Leu. Lye Pro Leu Trp Ser Xaa Ser Cye Arg Lys Ser Glu Leu Arg Zle 
20 25 30 

AGC AAA GOG TCA CTG GOT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC 149 
Ser Lye Ala Ser Z-eu Ala Asp Ser Gly Glu Ser Met Cye Lys Val Zle 
35 40 45 
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hoc MA CTA GGX AAT CAC AOT GCC TCT GCC AAC A7C ACC ATT CTG CAC 197 
S«r Lya Lmu Oly Asn Aap smr AIa s«r Ala Asa Zla Arg Urn Val Glu 
50 55 60 



TCA AAC GOT AAO AGA TGC CTA CTG OGT GCT ATT TCT CAC TCT CTA AGA 24S 

■ " Il« 

75 



smz Am Gly Lya Axq Cys Lm Lgu Arg Ala Zla Sar Gin Sar Lau Arg 

70 75 80 



QGA GTO ATC AAO GTA TCT GGT CAC ACT TQA ATC ACQ CAO CTG TOT GAA 293 
Gly Val Zla Lya Val Cya Gly His Thr Xaa Zla Thr Gin Val Cya Glu 
85 90 95 

ATC TCA TT6 TGA ACA AAT AAA AAT CAT GAA AGO AAA ACT CTA TGT TTG 341 
Zla Sar Cya Xaa Thr Aan Lya Aan Hla Glu Arg Lya Thr Lau Cya Lau 

100 105 110 

AAA TAT CTT ATG GGT CCT CCT GTA AAC CTC TTC ACT CCA TAA GGT GAA 389 
Lya Tyr Lau Mat Gly Pro Fro Val Lya Lau Pha Thr Fro Xaa Gly Glu 

115 120 125 

ATA GAC CTG AAA TAT ATA TAG ATT ATT T 417 
Zla Aap Lau Lya Tyr Zla Xaa Zla Zla 

130 135 

(3) ZMFORHATZOR FOR SBQOBNCS ZDBNTZPZCATZON ilUNBSR: 86s 
(1) SBQOBKCB CBMACmZSTZCSs 

(A) LPW THt 33 

(B) TTPBt nuclaic acid 

(C) STMMDBDIlXSSt aiagla 
(0) TOPOLOGY s liiiaar 

<ijc> PBATOnt s 

|D> OXHBR ZNPORMATZmi N at poaitiona 19, 25, and 

31 ia Znoaina. X can ba 
eytldlaa or thynidiaa. 

8IQ0BHCS D8SCRZPTZ0NI SEQ ZD NOt 86i 

COCAATTCT G CAGGARAOfC ARCCHGATCC NOG 33 

<3) ZRPORMATZCm POR SBQOBHCS ZOBMTZFZCATZON NUKBXRs 87s 

(i) S8Q0SIICB caMACTSRZSTZCSi 

<A) LXHO THt 37 

(B) TTFXi nuclaic acid 

<C) STRANDEOHSSSi aingla 

(D) TOFOLOGYi liaaar 

(is) 



(D) OXBBR ZBPQRKATZOIIt N at poaitiona 14, 20, 23, 

29 f and 35 ia Znoaina. , 

(ai) SBOUBKCS OBSCRZFTZONt SBQ ZD NOt 87 i 

AAGGATCCTG CAGNGTRTAN GCROCDATNA CCATNGG 37 

<2) ZHPORKAnoif FOR BBQOBRCB ZDBMTZFZCATZQN NOMBBRt 88t 
Ci) SBQOBIICE CHARACTSRZSTZCSs 
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<X) LERGTBt 34 

(B) TTPBs nucleic acid 

(C) STRMIOBONBSSi single 
(D| TOPOLOGY t UMar 

(iJC) FEMXUmt 

(O) OZBBR ZNPOlUATZOHt II «« positions 16, 21, and 

24 is Znosins. Y can bs 
eytidins or thymidine. 

(Ki) SSQOBIICE OBSCRIPTION: SBQ ZO NOt 88 1 

OOQMnCTG CM0C8GXYT CNGGHGMTA YMO 34 



(2) znFORmxzoii for sbooshci ZDENTzrzcjiTZQN mmaBH: 69 1 

(i) SBODBIICB CHMUCISIUSTZCSs 

(A) LgHCTOi 33 

(B) TYPBt nucleic acid 

(C) SnUOIDBDNESS: 8in9le 

(D) TOPOLOGY: linear 

(ix> FBXTORBs 

(D) OTHBR ZMFOBMATZONt N at positions 16 and 25 is 

Zttosine. Y can be cytidine 
or thymidine. 

(Xi) SBQUBliCB 088CRZPTZOIIS SBQ ZD HOt 89s 

COGXATTCTG CMGCMOAYX GYOGBOJOOIk Xht 33 



(2) XNFOlUUiTZON FOR SBQUBNCS IDBNTZPZCATZOH NUMBERS 90t 

U) SBODBVCB CHARACTBRZSTZCSt 

<R) ZJBROTBt 34 

(B) TYPBs nucleic acid 

(C> STRAimBDRBSSt single 

<D) TOPOLOGYt linear 

(ill) rsnTcmst 

(D) 0TB8R ZBPORMlTZONt N at positions 14, 15, 16, 

26, and 29 is Znosine. Y 
can be cytidine or 
thymidine. 

(Xl> SBWENCB DBSCRZFTZOBt 8EQ ZD HOi 90t 
MifiGXTCCTG GMRItHCKTR TAYTCNCCNO MOC 



(2) ZNFORIUTZOR POR SBQOENCB ZDBNTZPZCATZON NUHBERf 91s 

(i>. S8Q0BNCB CRMACTSRZSTZCSi 

(A) LSRGTHs 34 

(B> TYPB: nucleic acid 

(C) STRMIDEDHBSSs single 

(O) TOPOLOGY: linear 

(ix> FSRTtTRB: 
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{D) OTHER XHFORMIkTXONi H at positions 14, IS, 16* 

and 36 is InosiAO. Y can 
bo eytidino r thymidino. 

ini} SXQUBNCB DSSCRZPTZONs 58Q ID NOs 91s s 
AMGATOCTG CMimNCIlTR TMrTOfCCRC TBTC 34 
(2) ZXIfOBMATZOH FOR SSQIIBNCE ZDBNTZFZCHTZON NUMBERS 93: 

(1) SBQUBMCE CHMACTERZSTZCat 

{K} LBROTBl 33 

|B1 nVBs nucloie acid 

CC) STRJOIDBDnBSSs singlo 

(0) tOPOZMYs Linoar 

(ijc| fBKtORBt 

(D) OXBBR ZN90RMJITZ0N: N at positions 21, 28, and 

31 is Inosino. Y can bo 
cytidino or thymidino. 

(xl) SEODBRCE OBSCRZPTZONs 5SQ ID NOi 92: 

OOGARTTCTG Ca^CCKTCMG TRTGG6CNGC NM 33 

(2) ZNfORMRTZOR FOR SBQinDICE ZDEMTZFZCATZOII NUMBERS 93s 

(1) SB Q U E BC B CBMRCTBRZSTZCSs 

(A) LBBGIHs 35 

CB) TtVSs nucloie acid 

(C> STRAMDBDRBSSs single 

(D) TOFOLOCYs linoar 

<ix) FEATURES 

(0) OTHER ZMFORMATZONs N at position 31 is 

Inosino* y can bo cytidino 
or thymidino. 

<xi) SEQUENCE DBSCRZFTXONs S8Q ID NOs 93s 

COOAATTCTG CAOATBTm TYATOGARCC N8ARG 35 

(2) INFORMATION FOR SEQUENCE IDENTZFZCATION NUMBERS 94s 
(i) S EQU ENC E CRARACTERZSTZCSS 

(A) LENOTHs 35 

(B) TTFEs nucloie acid 
<C> STRANDEDNESSs singlo 

(D| TOFOLOOYs linoar 

(ijB) FEATURES 

{D} OTHER ZNFORNATZONs N at positions IB, 21, 24, 

27, and 33 is Znoslno. T 
can bo cytidino or 
thyaidino. 

(JCi) SEQUENCE DESCRIPTIONS SEQ ZD NOs 94s 
COOAATTCTG CAGGGGGNCC NCCNCCNTTT CCN6T 35 
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(2) INFOIUUTXOIf FOR SBQnNCE ZOERTtrXCATZON NIINBERs 95 1 
(i) 88Q0BNCB CBMACnRZSTXCSs 

ih) UnOXBt 33 

(B) TXni naeXttic acid 

(C) STRMIDSDMBSSt slngltt 
(0| TOPOLOOys liaMT 

(ix) FBATORBt 

(D) OTHER ZHraOATZONt N at poaitioM 21 and 24 is 

Znoaiae. Y can be eytidlaa 
or thymidina. 

(si) SWBNCB DSSCRZPTZCms SBQ ZD NOi 9St 

COGMRCTG GASTGGmC TNOTNATBGA RBO 33 



(2) ZNPORKATZOll POR SBQUENCS ZDSMTZFZCATZON mmSER: 96t 

(i) SBQUENCS CHMRCTERZSTZCSs 

(A) LBBW Ht 34 

(B| ntSs auclale acid 

(C) STRRHOBDNBSSs aingia 

(D) ropauoGtt liaaar 

(ix) RRTURBl 

(D) OTBBR ZHPORMRTZONt N at poaitiona 17, 20, and 

26 ia Zaoaina. Y can hm 
cytidina or thymidiaa. 

(si) SBQUBHCE OBSCRZPTZON: SBQ ZD NOs 96 

ARCGATCCTO CRCyTTIfCGai OOOCANACITT CRZG 34 



(2) ZNFORMATZON FOR SBQDBNCB ZOENTZFZCATZON NDMBBRt 97 1 
(i> SBQUBUCB CHARACTBRZSTZCSt 

(A> liBBC THt 33 

(B) nPBt nuclaie acid 

(C) STRABDBDHBSSi singl* 
fO) TOFOLOOYt liaaar 

(ix) FBATORBi 

(D) OTBBR ZRFORMATZONt M at poaition 19 ia 

Zaoaiaa. Y eaa ba eytidiaa 
or thynidiaa. 

(xi) SBQDBNCB OBSCRZPTZONi SBQ ZD NOt 97t 

AA66AT0CTG CABGCTTCNG OnCCATRAA RAA 33 



(2) ZNFORNAXZGH FOR sisQDBNCB ZDBBTZFZCATZON NUNBBRs 98 1 

(i) SBQDBNCB CHARACTBRZSTZCSt 

(A> LERGTRi 33 

<B) TYFBs nuelaic acid 
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(C) STRANDSDNZSSi alngl* 

(D) TOPOLOGY s liattar 

(ix) FBHTORBt 

(D) OTBBR XHFOWgiTZONt N at pQaitions 16, 22« 25, 

28, and 31 ia Znoalna. 

(xi) SBQUBRCB OESCUFTZONi SBQ XD NO: 98: 

AMGMOCTG CUGMIIGGiUi MGCMGIIGG NCC 33 



(2) ZHVORMKTXON FOR SMQUmCM ZDSNTZriCATZON NgMBBRi 99 1 
CD SBQimCB CHMIiCmXSTICS: 

(A) LERGTHs 3S 

(B) TTPBi nuclaic acid . 
(C> STRAKDBOBBSSt aiiigla 

(D) TOPOLOGY: linaar 

<ix) FBMORB: 

(D) OTBBR XNFORKIiTXONs N at poaltlons 17, 26« and 

29 ia Xnoalna. T can ba 
eytldiaa or thynidiM. 

(si) SBOOBNCB DBSCRXnXORt SSQ XO NO: 99: 

MOGMCCTO CMTTTRCCY TCDRTNACRA CRAAC 35 



(2) XRTORMATXON VOR SSODBNCB XDBNTXFXCATXM NUKBBR: 100: 

(1) 8BQ08HC8 CBARACIBRX8TXC8: 

(A) UTOTBt 33 

(R> TTVB: nuclaic acid 

(C) STRANOBDNBSS: aingla 

(D) TOFOXMY: linaar 

iiM) FBATURB: 

<D| OTBBR XNfORNATXON: N at poaition 18 la 

Xnoalna. Y can ba eytidina 
or thyaidlna* 

(Si) SBQUBNCS DBSCRXPTXOSt SBQ XD NO: 100: 

CASRTAYTCR TAYTCTCNOC AK60ATCCTG «G 



(2> XNPORMATZON VOR SBQUBNCB XDBNTXPXCATXON NUNBBR: 101: 
(i) SBOOENCB CSARACTBRXSTXCS: 

(A) XJBNOTBt 33 

(B) TYVB: nuclaic acid 
CO STRANDBDNB8S: aiagla 

(D> TOPOLOGY: linaar 

(is) F8ATURB: 

(D) OTHER XNTORMATXCai: N at poaition 19, 25, and 

31 ia Xnoaina. Y can ba 
eytidina or tbymidina. 
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(Xi) SBOOSRCB DBSCRXPTZONt SEQ ZD NOt 101s 
OOGMTTCTO CMAMGGflG hXGCnCAYhC NGA 



(2) XHrOBmnON FOR 88Q0ENCE lOENTZPXCHTZOII KUMBERs 102 < 

(1) SSaDBNCB GHMUiCTERISTZCSs 

(A) LSBGTBs 33 

IB) mzt nuelaic aei4 

(C) smmsmnsss singio 
(0) wpouomt liMAT 

Clx) FIBSORXt 

(D) OTBBll ZNTOMATZONt N at position 3 and 18 la 

Zaoaina. Y can ba eytidina 
or thymidiaa. 

fxi) SSQUBIICB DSSaanZONs SBQ ZD NOt 102: 

ocRocyAjma cttcytthgc mggatcctg cm 



(2) ZOTORNaTZOli FOR 8BQDBNCB ZDBNTZFZCATZON NUMBERS 103 1 

(i> SB^^IBRCB CBRRACTERlSTZCSs 

<A) LBHO THg 33 

<B) TTPBi nuelaic acid 

(C) STRMiDBOHBSSs ainglo 

(D) TOFOLOQTt liaaar 

CiJC) FBATDRBs 

<D) OTHBR ZHFORMATZmt N at poaition 2, 6, 9, and 

IB ia Znoaina. T can ba 
cytidina or thymidina. 

(xi> SBaOENCB DSSCRZPTZONt SEQ ID NOt 103 1 

ccRocitaaRG cnonmiGc aagcatcctg cac 33 



(2) ZHFORMATZON FOR SBQOEIICB ZDENTZFZGATZOR NUMBBRi 104 1 
(i) SBQOENCE CBARACXBRZSTZCSt 

(A) tBHO THi 30 ' 

(B) TTPSi nuelaic acid 

(C) 8TRAI1DBD1IE8SS ainola 
(D> TOFOLOGXs linaar 

FSATORBt 

(0) OTHER ZNFORMATZOHt N at poaition 3, 12 » and 15 

ia Znoaina.r can ba 
cytidina or thymidine. 

(Xi) SBQUSNCS DBSCRZFTZCafi SEQ ZD NOt 104 1 

TCNGCRAART ANOCNGCAAO C A TCCT C CAC 30 



(2) ZNFORMATZON FOR SBWSNCB ZDBNTZFZCATZON NUHBSRt 105 1 
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(1) SBQUBKCB CBMACmZSTZeSi 

(A) LSNCTBi 38 

(B> rmt auciaie acid 

(C) STMNDSONBSSi single 

(D) TOPOLOCYi linaar 

. <xi} SBQOBHCS DSSCRZPTZONt SBQ ZD NOi 105 1 
CJITCCATCTG CAGGCTGAXT CTGGACAXXA TAT6TGCA 36 

(2) ZNTOilKATZM FOB BBOmiCB ZDBMTZFZCATZON NUMBER^ 106 1 
<i) SBQOERCB CaRRRCTBRZSTZCSs 

ih) LBRGTBs 37 

(B) TYPBt nuelaic acid 
CC) STRARDEORBSSi Bingltt 

<D> TOKLOOYt lifwar 

(Xi) SBQOBHCS OBSCRIPTZORi SBQ ZD ROs 106t 
AACGATCCTG CAGCCACATC T06A6TCGAC ATCGAtT 37 

(2) ZRFORMATZCRl FOR SBQOBRCE ZDERTZFZ CATION NUMBERt 107: 

<1) SBQDSRCS GHARACTBRISTXCSt 

|A| WGTBl 37 

|B> TEPBt nuelttle acid 

(C| STRARSEORBSSs aiagla 

(D) TOPOLOGTt llaaar 

(Xi) SBQOBRCE DESCRZPTZOMs SBQ ID NOt 107: 

C06AATTCXG CA6T6ATCAC CAAACTACGA AAT6ACA 37 

(2) ZRFORMATZOR FOR SBQOBRCE IDERTZFICATZOR NOMBER: 108: 
(i) SBQOBRCE CHARACTERZSTZCSs 

(A) UmCTB:. 37 

(B) TTPEt nuelaie acid 

(C) STRARDEDRBSSi singiB 
(0) lOPOUGy: linaar 

(xi) SBQ U BRC B DESCRXPTXORs SBQ ZD RQ: 108: 

CATG6ATCT0 CA0CCTA6TT TGCTGAtCAC TTT6CAC 37 

(2) ZRFORMATZOR FOR SBQOERCB ZDBNTZFZCATZON NUMBER: 109: 
(i) SEQOXRCS CHARACTERZSTZCSt 

(A) LERGTB: 37 

(B) TTFB: nuelttie acid 

(C) 5TRARD8DRB88: aiagU 

(D) TOFOXXXSY: llnmr 

(Xi) SBQDBRCB DESCRZPTZOMi SEQ ZD NO: 109: 
AAfiGATCCTG CAGTATATTC TCCA6AATCA CCCAGTG 37 
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(2) INFOKMATZON FOR SEQUENCE ZDENTZrZCATZON NUMBER: 110 1 

(1) SEQUENCE CHMACTERZSTZCSs 

(A) ZJnOTBi 34 

(B| TYPE I nuelaie acid 

(C) STNANDSDNBSSt singls 

(O) TOPOZMTt liaaar 

(Xi> SEQUENCE OBSCRZPTZONs SEQ ZD NOs 110 s 

AAOOATCCTO C M CC A CC CA GTA66CATCT CTTA 



(2) ZNTOSKATZW VON SEQUENCE lOENTZnCAtZON NUHBERs Ills 
(1) SE QU EN C E CHMACXERXSTZCSs 

(A) ZJtNOT Hs 35 

(B) TtFEs nuclalc acid 

(C) STMNDEDNESSs singla 
(0) tOPQLOGYs llnaar 

(Xl) SEQUENCE DESCRZPTZONs SEQ ZO NO: 111: 

CCOAATTCTG CAfiCMAACT TOCCATTACC AAAOC 



(2) ZNPORKATZON FOR SEQUENCE ZOERTZPZCATZQN NUMBERS 112s 

(1) SEQUENCE CHARACTBRZSTZCSs 

(A) lENC THs 33 

(Bl TTFEt nuclalc acid 

<C> StRAN DEDNESSs aiagla 

(0| TOMMGEs liaaar 

(xi) SEQUENCE DESCRZPTZONs SEQ ZD ROs 112: 

CATCC06GGA TQAAGAOTCA GOACTCTCIG GCA 



(2) ZNFORMATZON FOR SEQUENCE ZOENTZFZCATZON NUMBER: 113: 

(i) SEQUENCE CNARACTBlUSTZCSs 

(A) LENG TH: 39 

(8) TYPES , msclaie acid 

( C) STRANDEDNESS s aingla 

(D) TOPOLOOYs liaaar 

(Xi) SEQUENCE DESCRZPTZON: SEQ ZD NO: 113: 
ATACCCGG6C TOCAOACAAT GAGATTTCAC ACACCTOCG 



(2) ZNPORNATZON FOR SEQUENCE ZDENTZFZCATZON NUMBER: 114s 
(i) SEQUENCE CHARACTBRZSTZCSs 

(A) LBNOTBs 36 

(B) TYPE: iraelaic acid 

(C) STRANDEDNESSS ailiula 
(O) TQPQLOGTs llnaar 

(xi) SEQUENCE DESCRZPTZONs SEQ ZD NO: 114: . 
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AM^GATCCTG CMSTTTGGM CCXGCCAGJU; ACTCCT 36 

(2) ZHPOWfATXOM FOR SEQUENCE ZOENTZFZCATZON NUMBERS 115i 
<1) SEQUENCE CHMXCTERZSTXCSs 

(A) LBNCTHi 39 

(B) TTPEt nuelttlc Mid 

(C) STRMDEDNESSt ■ifigltt 
(0) T0P0UM3YS linaar 

(Xi) SEQUBNGK OESOUPtZONt ESQ ZO NOs 115 1 

MACCCGG6C T6CMIATCIIG ATRCACACA CCT CCC T G A 39 

<2) XNVOEMAnOll FOR SEVnNCE ZOSNTIFICATZON NUMBER: 116t 
(1) SBQQENGS CHMACTERZSTZCSi 

(A) LENQTBt 12 

(B) TTFE: imlno acid 
<C) STRANDBDNESS: 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DEECRZPTZONs SEQ ZD NOs 116: 

Bia Gin Val Trp Ala Ala Lya Ala Ala Gly Lau Lya 

IS 10 

(2) ZNFmMATZON FOR SEQUENCE ZDBNTZFZCATXON NUMBERi 117: 
|i) SEQUENCE CHaRACTERZSTZCS: 

(A) LENGTH: 16 

(B) TZVEi asino acid 

(C) STRANDEDNESSt 

(O) TOPOLOGTt linaar 

(Xi) SEQUENCE DESCRZPTZONt SEQ ID NOt 117: 

Gly Gly Lau Lya Lya Aap Sar Lau Lau Thr Val Arg Lau Gly Ala Aan 

IS 10 15 

(2) ZNFOBMATZON FOR SEQUENCE IDENTZnCATZON NUMBER: 118: 
Ci) aEQUEWC E CBARACTERZSTZC8: 

(A) LENGTH: 13 

(B) TtPE: amino acid 
(C> STRANDBDNESS: 

(D> TOPOLOGY: linaar 



<ix) FEATURES 
<D) OTB 

(Xi) SEQUENCE DESGRZPTXON: SEQ ZD NO: 118: 

Laa Gly Ala Trp Gly Pro Pro Ala Pha Pro Val Xaa Tyr 
15 10 

(2) ZNFtMUIATZON FOR SEQUENCE ZDENTZPZCATZON NUMBERS 119: 



(D) OTHER ZNFOBMATZON: Xaa in poaition 12 ia 

unknown. 
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(1) 8EQUSNCS CBARACTEIUSTXCSt 

(A) LBRCTBt 23 

(B) TTPBi laiao acid 

(C) STRANDBDNSSSi 

(O) TOPOLOOYt linear 

(xi) SBQOENCB OXSCRXmOirt SEQ ZD NOt 119 s 

tma Lmx Thr Val Avg Lm Oly JUa Trp Oly Hla Pro Ala Pha Pro Sar 
IS 10 15 

Cys Gly Arg Lou Lya Glu Aap 

20 

(2> ZHTOBMATZON PGR SXQU8HCE ZDENTZPZCATZON lIUKBERi 120s 

(1) 8EQUBRCB CHARACTEAZSTZCSt 

(A) LgirO THs 13 

<B) TtPXt amino aeid 

(C) STRAMDEDRBSSi 

<D) lOPOLOGTs linaar 



(ix) 

(D) OXHBR ZmORNATZ«t Baa in poaitioa 10 in 



(Xi) 88Q0BBC8 DBBCRZPTZONs SBQ ZD NOs 120$ 

Tpr Zla Pho Pha Mat Glu Pro Glu Ala Xaa Sar Sar Gly 

IS 10 

(2) XRPORMATZON FOR SBQOENCB XDEMTZPZCATZON NUMBBAt 121i 

(i) SB0OBBC8 CHARACTBRZSTZCSs 

<A) LBHO lHi 23 

<B) TYPES aoUJio acid 

<C| STBAIIDBDHBSS: 

(D) TOPOLOGTs linaar 

(xi) SBOOERCB DBSCRZPTZONs SEQ ZD NOs 121s 

X»ya Glu Aap Sar Aro Tyr Zla Pha Pha Mat Glu Pro Glu Ala Aaa Sar 

1 5 10 .15 

Sar Gly Gly Pro Gly Arg Lau 
20 

<2) ZNPOBMATZON FOR SEQUENCE ZDENTZPZCATZON NUMBERS 122s 

(i) SBQOENCB CBARACTBRZSTZCSs 

(A) ZAtGTBs 14 

CB) TYPES aaino aeid 

(C) STRANDBDMBSSs 

(O) TOPOLOGYt linaar 
(Xi) SEQDXNCB DBSOlZPnOHs SBQ ZD NOt 122 s 

Val Ala Gly Sar Lya Lau Val Jm Arg cya Glu Thr Sar Sar 

15 10 

(2) ZNPORNATZON FOR SEQUENCE ZDENTZPZCATZON NUMBERS 123s 
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(1) SBQOERCB CBMACTERXSTZCSs 

(A) tgHC THt 16 

(B) nPBt uino add 

(C) STRMfOBONBSSt 

(0) T0PQL06Y« linear 

(xi) SEQOBNCB DSSOUPTZONt SEQ ZD HO: 133: 

oiu Tyr Lys cys Lm tym Phm Lys Trp Ph« lym Vym Ma Thr val Hat 

15 10 15 



(2) ZHTOmHTZGR FOR 8BQ0BRCB ZDBmzrZCJITZON NOMBBRi 124 1 
(i) SgQUEHCB CBMIMTBiaSTZCSt 

(A) mtO THt 26 

(B) TTPBt amino acid 

(C) STRAMDSDHESS: 

(D) TDPOLOCY: liaaar 

(xi) SSQOBNCB OBSCRZPTXONt SSQ ZD HO: 124: 

Cya Glu Thr Sar Sar Glu Tyr Sar Sar L«u Lya Pha Lya Trp Pha Lya 

15 10 15 

Aan Gly Sar Glu Lau Sar Arg Lya Aan Lya 

20 25 

(2) nrcmHATzoif ran sboobhcb zdbmtxfzcatzon rombbiii i2S^t 

CD SBOMKHCB GHABACTBRZSTZCS: 

<A) LBKC THt 13 

(B) TTPB: amino acid 

(C) STRAHDBDHBSSf 

(D) TOPQUOTi linaar 

(U) FBATOBBt 

(D) OTBBR ZHrORHATZOH: Xaa in poaition 12 ia unknown. 
<xi) SBQU8HCK DBSCRZPTXOBs SBQ ZD HOi 125 1 

Lya Ala Sar Lau Ala Aap Sar Gly Glu Tyr Hat Xaa Lya 

1 S . 10 



(2) ZNTOBMATZOH FOR SBQDBNCB ZDBHTXFZCATZOH NOMBBR: 126t 
ID SBQOEHCB CBARACTERXSTZCSi 

(A) LBHGTHi 23 

(B) Tmt amino acid 

(C) STSAKDBDSBSSl 

<0) TOPOLOGYt liaaar 

|XD SBQOBRCB DSSCRZPTZONs SBQ ZD HOt 126t 

Glu Lau Arg Zla Sar Lya Ala Sar Lau Ala Aap Sar Gly Glu Tyr Hat 

1 ' S 10 15 

Cya Lya Val Zla Sar Lya Lau 

20 
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(2) ZNrORIATXOK FOR SEQUBNCZ ZDXNTXFZ CATION NOMBERg 127 1 
(1) S9QUBHCB CHAIUCTSRZSTZCSt 

(A> ZJmOTBs 12 

(B) TTPBs amino acid 

(C) STRMOSONBSSt 

(O) TOPOLOGYs liMar 

(JCi) BBQIIBNeB DBBGRZPTZONi S8Q XO NOl 127l 

Ma amx Xm Ala hmp Glu tyr Glu Tyr Mat Arg Lya 
IS 10 

<2) XllfQRMATXOil FOR SBWBHCE XDBSTXFXCATXON NtlMBBRs 128 1 

(1) saOOBBCB C8ARACTBRXSTXCSI 

(A) LBKO THi 22 

(B> nFBi amino acid 

(C) STRABOBDllBSSs 

(O) TOFOLOGys linoar 

(Xi) SBQUBIICB OBBCRIFTXONi SBQ ID NOt 128 1 

X.Gtt Arg Xla 8«r Lya Ala Sor Xm AU Aap Sar Gly Glu Tyr Na« Cya 

i 5 10 15 

Lya Val Zla Sar Lya Laa 

20 

(2> ZmORHATZOB FOR SBOOBNCB ZDBRTXFZCATZOM NOMBBRi 129s 
(i) 8BVIBMB GHARACXBiaSTICBf 



(A) LHHO THt 744 

(B) TTFBi nuclaie acid 

(C) 8TRAHDE0BBS8 t aingla 

(D) TOFOLOGYs linaar 

(xi) SBQOBNCB OBSCRXFTZOHt SBQ ZD NOs 129t 

CCX6GAG CAT CAA CTO TOG GOG GCO AAA GCC GG6 GGC TTG AAG AAG GAC TOG CT6 55 
Bia Gin Val Trp Ala Ala Lya Ala Gly Gly Lau Lya Lya Aap Sar Lou 

15 10 15 

CTC ACC GTG CGC CTG GGC GCC TG6 GGC CAC COC GCC TTC CCC TCC TGC 103 
Lau Thr Val Arg Lau Gly Ala Trp Gly Hia Fro Ala Fha Fro Sar Cya 
20 ' ' 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 151 
Gly Arg Lau Lya Glu Aap sar Arg Tyr Xla Fha Fha Mat Glu Fro Glu 
35 40 45 

GCC AAC ACC ACC GGC GGG CCC GGC CGC CTT CCC AGC CTC CTT CCC CCC 199 
Ala Aan Sar Sar Gly Gly Fro Gly Arg Lau Fro Sar Lau Lau Fro Fro 
50 55 60 

TCT OGA GAC GGG COG CAA CCT CAA GAA GGA GGT CAG COG 6GT CCT GTG 247 
Sar Arg Aap Gly Fro Glu Fro Gin Glu Gly Gly Gin Fro Gly Ala Val 
65 70 75 80 

CAA CGG TGC GCC TTG CCT CCC CGC TT AAA GAG ATG AAG ACT CAG GAG 295 
Gin Arg Cya Ala Lau Fro Fro Arg Lau Lya Glu Mat Lya Sar Gin Glu 

85. 90 95 
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TCT 616 OCX 66T TCC AAX CCA 616 CTT CGO 16C 6M ACC A6T TCT 6AA 343 
S«r Val Ala ly Smr hym Lau Val Lau Arg Cya Glu Thr Sar Smz 61u 
100 105 110 

TAC TCC TCT CTC AAG TTC AAC TGG TTC AAC AAT GCC ACT GAA TTX AGC 391 
Tyr Sar Sar Lau Lya Pha Lya Trp Pha Lya Aao Gly Sar Glu Lmu Sar 
1X5 120 125 

CCA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AG6 CCG GGG AAG 439 
Arg Lya Aaa Lya Pro Glu Aan Xla Lya Ila Gin Lya Arg Pro Gly Lya 
130 X35 140 

TCA CAA CTT OOC ATT ACC AAA 6C6 TCA CT6 OCT GAT TCT G6A GAA TAT 487 
Sar Glu Lau Arg Xla Sar Lya Ala Sar Lau Ala Aap Sar 61y Glu Tyr 
145 ISO X5S X60 

ATC TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC ACT GCC TCT GCC AAC 535 
Mat Cya Lya Val llo sar Lya Lau Gly Asn Aap Sar Ala Sar Ala Aan 

165 170 175 

ATC ACC ATT GTO 0A6 TCA AAC GGTAA6AGATGCCTAC36C0TGCT ATT 583 
Xla Thr Xla Val Glu Sar Aan Gly Lya Arg Cya Lau Lau Arg Ala Xla 
180 185 190 

TCT CAG TCT CTA AGA GGA GTG ATC AAO OTA TGT GOT CAC ACT 625 
Sar Gin Sar Lau Arg Gly Val Xla Lya Val Cya Gly Kia Thr 
195 200 205 

TOAATGACGC AGGTGTGTGA AATCTCATTG TGAACAAATA AAAATCATGA AJUOGAAAAAA 685 

AAAAAAAAAA AATOOATGTC CACTOBAfflAT 6T6GCT6CAG GT06A C TCT A GAG6ATCCC 744 



(2) XHFORNATXOIt POK SIQDBIICE X08STXPXCATX0N ROMBBIIs 130} 

<1) S8Q0SKCB CaARACTBRXSTICSs 

(A> LBBG THt 1193 

(8) TTPBs nuclaic acid 

(C) ST8A»DBDIi88Si aUgla 

(O) TOPOLOGTs liaaar 

iml) SXOOnCS DSSCRXPTXCHi S8Q XD NOt 130: 

CCTGCAO CAT CAA 6TO T66 606 600 AAA 6CC 666 GGC TT6 AAC AAG 6AC T06 CTO 55 
Hla 61a Val Trp Ala Ala Lya Ala 61y Oly Lau Lya Lya Aap Sar Lau 

15 10 15 



CTC hCC GTO COC CTO G6C CCC TGG OOC CAC CCC GCC TTC CCC TCC TCC 103 
Lau Thr Val Arg Lau Gly Ala Trp Gly Hla Pro Ala Pha Pro Sar Cya 
20 25 30 

GGG OOC CTC AAO GAG GAC AGC AGO TAC ATC TTC TTC ATC GAG CCC CAG 151 
Gly Arg Lau Lya Glu Aap Sar Arg Tyr Xla Pha Pha Hat Glu Pro Glu 
38 40 45 . 

GCC AAC AGC AGC 660 6G6 CCC GGC CCC CTT CCG AGC CTC CTT CCC CCC 199 
Ala Lya Sar Sar Gly Gly Pro Gly Arg Lau Pro Sar Lau Lau Pro Pro 
50 55 60 

TCT CGA GAC 666 COG GAA OCT CAA GAA G6A OCT CAG CCG G6T 6CT 6T6 247 
sar Arg Aap 61y Pro Glu Pro Gin Glu Oly Gly Gin Pro Gly Ala val 
65 70 75 80 
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CM OCO tec 6CC TTG CCT CCC CCC tTO AAA GAG ATC AAC ACT CAG GAG 295 
Gin Arg Cyg Ala Lmu Pro Pro Arg Imu Lys Glu Mot tya sor Gin GIh 
as 90 95 

TCT GT6 6CA GOT TCC AAA CTA GtG CTT 060 TCC GAG AGO AGX TCT GAA 343 
SOT VMl AlA Gly sor I.yo Lou v«i Lou Arg Cyo Giu Thr sor smt Glu 
100 XOS 110 

TAG TCC TCT CTC AAG TTC AAG TG6 TTC AAG AAT GGG A6T GAA TTA AGO 391 
Tyr Sar Sor Lou tyo Pho Lyo Trp Pho Lyo Aon Gly sor Glu tou 8or 
lis 120 12S 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGO COG GGG AAG 439 
Arg Lyo Aon Lyo Gly Gly Aon xlo i«yo Xlo Gin Lyo Arg Pro Gly Lyo 
130 135 140 

TCA GAA CTT OOC ATT AGC AAA GOG TCA CTG GCT GAT TCT GGA GAA TAT 487 
sor Glu LOU Arg Xlo Sor Lyo Ala Sor Lou Ala Aop Sor Gly Glu Tyr 
145 ISO ISS 160 

AXO TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT CCC TCT GOC AAC 535 
Mot cyo Lyo Val Xlo Sor Lyo Lou Gly Aan Aop Sor Ala Sor Ala Aan 

165 170 175 

ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG AGA 563 
Xlo Thr Xlo Val Glu Sor Aon Ala Thr sor Thr Sor Mr Ala Gly Thr 
180 185 190 

AOC CAT CTT GTC AAG TQX OCA GAG AAG GAG AAA ACT TTC TGT GtO AAT 631 
sor Bis Lou Vol Lyo Sor Ala Glu Lyo Glu Lyo Thr Pho Cyo Vol Asa 

195 200 205 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 679 
. Gly Gly Glu Cyo Pho Mot Val Lyo Aop Lou Sor Aan Pro Sor Arg Tyr 
210 215 220 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GOO AGA TGT ACT GAG AAT 727 
Lou Cyo Lyo Cyo Gin Pro Gly Pho Thr Gly Ala Arg Cyo Thr Glu Aan 
225 230 235 240 

GTG OOC AtO AAA OZC CAA ACC CAA GAA AGT GCC CAA ATG AGT TTA CTG 775 
Val Pro Mot Lyo Val Gin Thr Gin Glu Sor Ala Gin Not sor Lou Lou 
245 250 255 

GTO ATC GCT GCC AAA ACT AOG TAAT6GCCMS CTTCTACAGT ACOTCCACTC 826 
Val Xlo Ala Ala Lyo Thr Thr 
260 

CCRTCTGTC TeXOCCXOAA TAOOOCATCT CAaTOOGTGC OGOTTTCTTG TTGC06CATC 886 

TC OC OT CA OA TT CC T CC T A O AGCTAGAT6C GTTTTAOCAO CTCTAACATT GACTGCCTCT 946 

GCCTGTOGCA TOAOAACATT AACACAAOOO ATTGTATGAC TTCCTCTGTC CGTGACTACT 1006 

GOGCTCZOAG CTAGTOQTAO GTG0GTAAG6 CTCCAGT6TT TCTOAAATTO ATCTT6AATT 1066 

ACT6T6ATAC CACATGATAO TCOCTCTCAC CCA6TGCAAT GACAATAAA6 GCCTTGAAAA 1126 

GTCAAAAAAA AAAAAAAAAA AAAAAATOGA TGT0GACT06 AGATGTGGCT GCAGGTCGAC 1186 

TCTAGA0 1153 

<2) XRfORNATXOn FOR SBOOBHCS XDXMTXTXCATXOII mmBBRt 131s 
(1) SBQOEtlCB CHARACTSRXSTXCSt 
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(A) LgKO Xat 1108 

(B> TYPE I nuelttic acid 

(C) STRAMDSDHESSt aijigle 

(D) TOPOLOGY t linttar 

(Xl) SSQOSNCB D85CRXFTX0N: SBQ ID NO: 131: 

CCTGCAG CAT CAA GTG TOG GCG GOG AAA GCC GG6 GGC TTO AAO AA6 GAC TOG CTC SS 
His Gin VAi Trp Ala Ala Lys Ala Gly Oly Lau Lys Lya Aap sar Lau 

15 10 IS 

CTC ACC GTG C6C CTO GGC GCC TOG GGC GAC CCC GCC TTC CCC TCC TOO 
ima Thr Val Arg Jm Gly Ala Trp Gly Bla Pro Ala PbQ Pro 8or Cya 
20 2S 30 

GG6 OGC CTC AAO GAG GAC AGO AGO TAC ATC TTC TTC ATG GAG CCC GAG 
Gly Arg Lou Lya GIu Aap Sar Arg Tyr Xla Pha Pho Nat Glu Pro Glu- 
38 40 45 

GCC AAC AGO A6C GGC GG6 CCC GGC OGC CTT COG AGC CTC CTT CCC CCC 

Ala Aao Sar Sar Gly Gly Pro Gly Arg Lau Pro Sar Lau Lau Pro Pro 

50 55 60 

TCT CCA GAC 006 COG GAA CCT CAA GAA GGA GGT GAG GCG GOT GOT GTG 
sar Arg Aap Oly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala val 
65 70 75 80 

CAA COG TOO GCC TTG CCT CCC OGC TTG AAA GAG ATG AAO AOT CAG GAG 
Gin Arg Cya Ala Lau Pro Pro Arg Lau Lya Glu Mat Lya Sar Gin Olu 
85 90 95 

TCT OTO OCA OOT TCC AAA OTA GTG CTT COG TOO CAG AOC ACT TCT GAA 
Sor Val Ala Gly Sar Lya Lau Val Lau Arg Cya Glu Thr Sar Sar Olu 
100 105 110 

TAC TOO TCT CTC AAO TTC AAG TOG TTC AA6 AAT GOG AGT GAA TTA AGC 
Tyr sar Sar Lau Lya Pha Lya Trp Pha Lya Aan Gly Sar Glu Lau Sar 

1X5 120 125 

OCA AAO AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGO COG GOO AAG 
Arg Lya Aan Lya Pre OXo Aan ZXa Lya XXa Gin Lya Arg Pro Pro Lya 

130 135 140 

TCA GAA CTT OGC ATT AGC AAA GOO TCA CTO GOT GAT TCT GGA GAA TAT 
Sar Glu Lau Arg Xla Sar Lya Ala Sar Lau Ala Aap Sar Gly Glu Tyr 
145 150 155 160 

ATO TOC AAA OTO ATC AGC AAA CTA GGA AAT GAC AGT 000 TCT GCC AAC 
Nat Cya Lya Val ZXa Sar Lya Lau oly Aan Aap Sar Ala Sar Ala Aan 

165 170 175 

ATC ACC ATT GTG 6AO TCA AAC GCC ACA TCC ACA TCT ACA GOT GOG ACA 
Xla Arg Xla Val OXu Sar Aan AXa Thr Sar Thr Sar Thr Ala Gly Thr 
180 185 190 

AGC CAT CTT GTC AAC TGT OCA GAG AAG GAG AAA ACT TTC TOT GTG AAT 
Sar Hla Lau Val Lya Cya Ala Glu Lya Glu Lya Thr Pha Cya Val Aan 
195 200 205 

GGA GGC GAG TCC TTC ATO GTG AAA GAC CTT TCA AAT CCC TCA ACA TAC 
Gly Gly Glu Cya Pha Mat Val Lya Aap Lau Sar Aan Pro Sar Arg Tyr 
210 215 220 

TTO TGC AAO TCC CCA AAT GAG TTT ACT GGT GAT CGC TOC CAA AAC TAC 
Lau Cya Lya Cya Pro Aan Olu Pha Thr Gly Aap Arg Cya Gin Aan Tyr 
225 230 235 240 
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CTA AW CCC A6C TTC TAC ACT ACO TCC ACT CCC TTT CTC TCT CTC CCT 77S 
V4i MAt Ala 8w Ph» Tyr 8w Thr Smx Thr Pro Pha Lm 8«r Lau Pr 
24S 250 255 

QAA TMC6CATCT CMTOGGTOC C6CTTTCTTC TTCCCCCATC TCCCCTCACA TTCCCCCTAC 838 
6Xu 

ACCTACATGC OTTTTACCAC GTCTAACATT GACTGCCTCT CCCT 6 T 0 CCA TCAOAACATT 898 

AACACAAGCG ATTCTATCAC TTCCT C T O TC OGTGACTAGT CCCCTCT O AC CTACTeOTA0 958 

CTCOCTAAOO CTCCAOTGTT TCTGAAATTG ATCTTCAATT ACTCTGAIAC 6ACAT6ATA6 1018 

TCCCTCTCAC CCK6T0AAT QACAATAAAG OCCTTGMUgi 6TCMAAAAA AAAAAAAAAA 1078 
AAAAATOOAT GT06ACT06A OATQTOGCTG 



(2) XNfOSmSZON FOR SIQOBVCB ZOSmZPXCATZON HmBBR: 132 s 

(1) SBOanCI CBMACTERXSTZCSt 

(A| tBWC THt 559 

(B) TCTSt melttic acid 

(C> STRARDBDnsSt alngXa 

(D) TOMLOOTt llOMT 



1108 



(D> OXB8R ZNFORMATZONt N la poaltiofl 214 is 

oaJcnown. 

Cxi) S8Q0SIIC8 DBSCRZPTXON: SBQ ZD NOt 132s 

AOTTTOCCCC CCCAACTT6T CGGAACTCTO GGCTOCCCOG CACGGCAGOA COGQAGOGGC 60 

GGOGCCTCCC CaCGOQAXGC C APCOCCCC C CCOAOCGTAA TCCCCTCTCC CTCCT OCC CC 120 

TGOnOCCOO C OC O AC C CAO 6CA60CSAGAG GAG06GAC0G CCCCCCOAAC OOAGGACTCC 180 

OCMO06O6C GGCMCA66A GCCACCC06C GA0N06TGC0 ACCCGGA0G6 AG06CCC6CC 240 

AOT CCC ACCT C CC COO OACC GGA0CTT6GG TC O CO O CCCT CCC0CC066C GACAGCAGAC 300 

OCXCOOCOeC AOGOOGOGCO OOOCTOGOCC COCTC CC T O O CCOC O CTCCA CTCCCCCCAC 360 

AAACTTTTCC OQAAOCOOAT CCC aM K>X;C T C O OA C C CAaAC TTCTOGOGCO TOOCCTTOGC 420 

C OOCaC C OOT CO Q C OCaiaAO OOrOCaCTTC TOCC O OO A B ATO too OAO CGC AOA 474 

Mat aar Glu Arg Arg 

1 . 5 

OAA CCC AAA GGC AAB 600 AAO GOC CGC AAO AAO GAC CCA 60C TCC COG 522 
Glu Gly Lya Gly Lya Gly Lya Gly Gly Lym Lym Asp Arg Gly Sar Gly 
10 IS 20 

AAO AAO CCC GTG CCC GCG OCT GGC GGC CCO AGC CCA G 559 
Lya Lya Pro Val Pro Ala Ala Gly Gly Pro 8ar Pro Ala 
25 30 

<2) ZNfORNATZQN PGR 8B0DERCE ZDZNTZFZCATZOH NUMBERS 133s 
(1) 8XQ0BRC8 CHMACTBRZSTZCSt 

(A) LSHGTH: 252 

(B) TYP8: nuclaic acid 
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(C) SnUOIDSONBSSs aiiigitt 

(D) TOMLOOYs linMC 

<lx) TBXTtnOBi 

(0) OTHER ZHFORmTZOlls N in position 6 could bo 

oithor A or G. 

(Jli) SBODmCB DXSCRZPTXONi 8BQ ZD KOi 133 1 

CC GAT CAR GTO TGO 000 000 AAA GCC GOG GGC TTG AAC AA6 GAC TO! 47 
Bio Gin VAl Trp Ala Ala tym AIa Gly Oly Lou Lyo Lyo Asp Sor 

15 10 15 

CXO CTC AOC GtG 06C CZO GGC GCC TGO GGC CAC COC GCC RC CCC TCC 95 
Lou Lou Thr Vol ACQ Lou Gly Ala Trp Gly Bio Pre Ala Pho Pro Sor 
20 25 30 

TGC GOO OQC CTC AAG GAG GAC AGC AGO TAC ATC TTC TTC ATG GAG CCC 143 
Cyo Gly Arg Lou Lyo Glu Asp sor Arg Tyr Xlo Fho Pho Mot Glu Pro 
35 40 45 

GAG GCC AAC AGC AGC GGC GGG CCC GGC CCC CTT CCC AGC CTC CTT CCC 191 
Glu Ala Aon Sor -Sor Gly Gly Pro Gly Arg Lou Pro sor Lou Lou Pro 
SO 55 60 

CCC TCT CGA GAC GGG CC6 GAA CCT CAA GAA GGA GOT CAG CCC GOT GCT 239 
Pro sor Arg Aop Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala 
65 70 75 • 

GTG CAA COG TGC G 2S2 
Val Gin Arg Cyo 
80 



(2) ZHPOnCATZO!! FOR SBQU8HCS ZDSNTZPZCATZOII NOMBBRs 134 i 
(i> SBQUEBCB CBARACTBRZSTXCSs 

(A) LBBG TBt 178 

(B) TTPBt nueloie acid 

(C) STBAHDBDHBSSt oiaglo 

(D) TOPGLOGri liaoar 

<xi) S8Q0BBCB DBSCRZPTZONi SBQ ZO NOi 134i 

CCT TGC CTC OCC GCT TOA AAO A6A TGA AGA CTC AGO ACT CTO TOG CAG 48 
Lou Pro Pro Arg Lou Lyo Glu Bia Lys Sar Gin Glu Sor Val Ala Gly 

15 10 15 

GTT CCA AAC TAG TGC TTC GOT GOO AGA CCA CTT CTG AAT ACT CCT CTC 96 
Sor Lyo Lou Val Lou Arg Cya Glu Thr Sar Sor Glu Tyr sor Sor Lou 
20 25 30 

TCA ACT TCA AGT GOT TCA AGA ATG CGA GTG AAT TAA GCC GAA AGA ACA 144 
Lyo Pho Lyo Trp Pho Lya Aon Gly Sor Glu Lou Sor Arg Lya Aon Lyo 
35 40 45 

AAC CAC AAA ACA TCA AGA TAC AGA AAA GGC CGG G 178 
Pro Gly Aon Xlo Lyo Zlo Gin Lyo Arg Pro Gly 

50 55 



(2) ZNPOmATZOM FOR SSgOBNCS ZDENTZPZCATZ N HUMBBRi 135 1 
(i) SEQDSNCS CHARACTERZSTZCSs 
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{K} LSRGTHt 122 

(B) mzt auelftic acid 

(C) sTMmmmsst siagu 

(D) TOPOLOGYs liiwar 

(si I SBQOUeiE OBSCRZPTZOMi 8BQ ZD NOt 13Ss 

0 MO TCA GM CTT 06C ATT AGO AAA COG TCA CT6 OCT GAT TCT OCA 46 
Lym Smr Olu Lm Arg Zl« 6mr Lys Al« S«r Lm Ala Asp Sor Cly 

15 10 IS 

OM WT ATO TCC AAA CTC ATC A6C AAA CTA GCA AAT CAC ACT OCC TCT 94 
Gltt Tyv MGt Cya Lys Val ZIb sar Lya Lau Cly Aan Aap Sar Ala 8ar 

20 25 30 

60C AAC ATC ACC ATT GTG GAG TCA AAC 0 122 
Ala Aan Zla TBr Zla Val Glu Bar Aaa Ala 
25 

(2 1 ZBrmKATZON MB SBODBBCB Z08BTZFZCATZ0N NUMBBBi 136s 
(i> BBOmiCB CBABACTBRZBTZCSi 



<A) tOGTBi 417 
(B) TCT Bi imelaie acid 

<C) BTiaiil DBPBBSSt aiagla 
(D) TOMLOGY: llnaar 

(Ki) SBOOIKS DBSCBZPTZOBt SBQ ZO NOs 136 s 

TCnAAACIA CMaOACTGT ATTTTCATGA TCATCATAOT TCTGT6AAAT ATACTTAAAC 60 

COCTTTGGTC AOATCTTGT AGO AAG TCA GAA CTT CCC ATT AGC AAA GOG 110 

Lya Sar Glu Lau Arg Zla Sar z.ya Ala 
1 5 

TCA CTG OCT GAT TCT CGA GAA TAT ATC TCC AAA CTC ATC AGC AAA CTA 158 
Sar Lau Ala Aap Sar Cly Glu Tyr Mat Cya Lya Val Zla Sar Lya Lau 
10 IS 20 25 



CGA AAT 


GAC 


ACT 


GCC 


TCT 


Gly Aan 


Aap 


Sar 


Ala 


Sar 








30 




AAG AGA 


TGC 


CTA 


CTG 


COT 


Lya Arg 


cya 


Lau 


Lau 


Arg 






45 




AAG 6TA 


TOT 


GOT 


CAC ACT 


Lya Val 


cya 


Gly 


Bia 


Thr 




60 







35 40 



50 55 



206 



254 



302 



TGAACAAATA AAAATCATGA AAGOAAAACT CTATCTTTCA AATATCTTAT CCGTCCTCCT 362 
GTAAAGCTCT TCACTCCATA AGGTGAAATA GACCTGAAAT ATATATAGAT TATTT 417 

(2> ZBTOBNATZOB FOB SBODBBCB ZDSNTZPZCATZOH NDMBBBs 137s 
(II SBQDBBCB CBMACTBBZSTZCSi 

(A) LBBO TBt 102 

(B) TTPEi nuclaic acid 

(C) STBANDBDBBSSs aingla 

(D) TOPOLOGYs llnaar 
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(xi) S2QUENCS DBSClUPTXOtI: 8EQ ID NOi 137 1 

AC ATC ACC ACT GGC AlO CCA GCC TCA ACT CAC ACA CCO TAT CTO TCT 47 
Glu Ila Thr Thr Gly Mat Pro Ala Smt Thr Glu Thr Ala Tyr val Sar 
1 5 10 15 

TCA GAG TCT CCC ATT AGA ATA TCA CTA TCA ACA GAA GGA ACA AAT ACT 95 
8«r Glu s«r Pro iXm Arg Zlo Smr val Ser Thr Glu Gly Thr Aan Thr 
20 25 30 

TCT TCA T 
Smr Smr Smr 
35 



(2) ZHrORMATXOH FOR 8BWENCB ZDBNTZPZCATZON NUMSns 138t 
(i) SBODBHCB CHAMCTBRZSTZCSt 

<A) LSRGTHt 69 

(B) TTFSs nuelaie add 

(C) STmHDBDNESSt siagla 

(D) TOPOXiOGys linaar 

(Xi) SBQOERCB OBSCRZPTZONi SBQ ZD NO: 138 1 

AAG TOC CAA CCT GGA TTG ACT GGA GCG AGA TGT ACT GAG AAT GTO CCC 48 
Lys Cys Gin Pro Gly Pho Thr Gly Ala Arg Cys Thr. Glu Aon Val Pro 
1 S 10 15 

ATC AAA GTC CAA ACC CAA GAA 
Mot Lyo Val Gin Thr Gla Glu 
20 



(2) ZHFORKATZOH FOR SBQUEHCX ZOBRTZFZCATZON NOMBER: 139s 

(i) S8Q0BHCS CKARACTERZSTZCSi 

<A) UBGTHt 60 

<B) TTPBs nucleic acid 

(C) STRABPBDNBSSs ainglo 

|D> TOPOLOGYt llBoar 

Csi) S8Q0BRCE DBSCRZPTZONs SBQ ZD NO: 139s 

AAG TGC CCA AAT 6A0 TTT ACT GOT OAT C6C TGC CAA AAC TAG 6TA ATG 
Lyo cy6 Pro Aon Glu Pho Thr Gly Aop Arg Cya Gin Aan Tyr Val Mat 

1 5 10 15 

GOC A6C TTC TAC 

Ala sor Pho Tyr 

20 

(2) ZNFORNATZON FOR SBQOBIICE XOBNTZFZCATZON NOMBBRs 140s 

(1) SBOOBNCS CHARACTERZSTZCSs 

CA) tBRGTBi 36 

<B) TTPBs nueloic add 

(C) STRANDBOllBSSs alogla 

(0) TOPOLOQTs linaar 



(xl) SBQOBNCE DBSCRZPTZONs SBQ ZD NOs 140: 
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A0T ACQ TCC ACX CCC TTT CTG TCT CTO CCT GAA TAG 36 
Smr Thr Smr The Pro Pbm Lau Smz Imu Pr GIu 
1 5 10 

(2) INFORMATION FOR SSQOBMCB lOeNTIFICATION NUMBERi 141s 

(1) $SQ08NC8 CBKRACnoaSTICSs 

CA) LBNQTHl 27 

|B) TTPBt nueloie acid 

(C) STNANDBDHSSSt •ingXa 
(Dl TOPUOGTt limar 

(si) SBQOERCB DSSCRIPTIONs SBQ ID NOt 14Is 

AAO CAT CTT 6G0 ATT GAA TTT AT6 GAG 27 
Lya Hia Lau Gly Zla Glu Pha Mat Olu 
1 S 

(2) INFORMATI(»r FOR SBQUENCE ZDZNTIFICATZON NONBSR: 142s 

(1) SEQUENCE CHARACTERISTICS: 

(A> lENC THt 569 

(B) TTPls atielaic aeld 

CO) STmNDBONBSSs aingla 

(D) TmnoOTt liaaar 

(xi) SIOOBNCl DISCRIPTZONs 88Q ID NOs 142 s 

AAA GOO GAO GAG CXC 1AC CA6 AAO AOA GTG CTO ACC ATT ACC GGC ATT 40 
Lya Ala Olu Olu Zm Tyr Oln Lya Arg Val Leu Thr Ila Thr Gly lla 

^5 10 15 

TOO ATC 606 CTO CTC OTO OTT GGC ATC ATC TOT GTG GTG OTC TAC TOO 96 
Cya Ila Ala Lau Lau Val Val Gly He Mat Cya Val Val Val Tyr Cya 

20 25 30 

AAA ACC AAO AAA CAA OGG AAA AAG CTT CAT GAC CGG CTT 000 CAO AGO 144 
Lya Thr Lya Lya Gin Arg Lya Lya Lau Hia Aap Arg Lau Arg Ola Smr 
35 40 45 

CTT COO TCT GAA AOA AAO ACC ATO A30 AAC GTA CCC AAC GOO CCC CAC 192 
Lau Arg 8ar Olu Arg Aaa Thr Mat Mat Aan val Ala Aaa Gly Pro Hia 
50 55 60 

CAC CCC AAT COG CCC CCC OAO AAC OTO CAO CTC GTG AAT CAA TAC OTA 240 
Bla Pro Aaa Pro Pro Pro Olu Aan Val Ola Lau Val Aan Ola Tyr Val 
65 70 75 80 

TCT AAA AAT CTC ATC TCT A6C GAG CAT ATT GTT GAG AOA GAG CCC GAG 288 
Sar Lya Aaa Val Ila 8ar Sar Olu Bla Ila Val Glu Arg Glu Ala Olu 
85 90 95 

AGC TCT TTT TCC ACC AGT CAC TAC ACT TOG ACA CCT CAT CAT TCC ACT 336 
Sar Sar Pha Sar Thr Sar Bla Tyr Thr Sar Thr Ala Hia Hia sar Thr 
100 105 110 

ACT ore ACT CM ACT CCC AGT CAC ACC TGO AGC AAT OGA CAC ACT GAA 384 
Thr val Thr Gin Thr Pro Sar aia sar Trp Sar Aan Gly Hia thr Glu 
115 120 125 

AGC ATC ATT TOO GAA AOC CAC TCT OTC ATC GTG ATC TCA TCC OTA GAA 432 
Sar Ila Ila Sar Glu Sar Bla Sar Val Ila Val Mat Sar sar Val Olu 

130 135 140 
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AAC AGT ACC CAC ACC AGC COG ACT GCC CGC COG AGA G6A C6T CTC AAT 480 
hmn Sar Arg HLm smx Smr Pro Thr GXy Gly Pro Arg GXy Arg Lou Aan 
145 ISO 15S 160 

GGC TTG GGA CGC CCT C6T GAA TGT AAC AGC TTC CTC AGG CAT GCC AGA 528 
Gly Lou Gly Gly Pro Arg Glu Cya Aan Sar Pha Lau Arg Hia Ala Arg 
165 170 175 

GAA ACC CCT GAC TCC TAC OOA GAC TCT CCT CAT AGT G AAAG 569 
Glu Thr Pro Aap Sar Tyr Arg Aap Sar Pro Hia Sar 
180 185 

(2) XCTORMATZON POR SWOmCZ ZDBIITZPXCATXOR NUMBBRs 143s 
(1) SgO DK H C E CBARACmaSTXCS: 

(A) LSRGTBs 735 

(B) TTPBs nuelaie aoid 

(C) STRANDSDMBSS: aingla 
(D> TOPOLOGY I linaar 

(xi) SBQUBNCS DBSCRXPTXCNt S8Q ID NO: 143: 

G TAT GTA TCA GCA ATG ACC ACC COG GOT CGT ATG TCA CCT GTA GAT 46 
Tyr Val Sar Ala Mat Thr Thr Pro Ala Arg Hmt, Sar Pro val Aap 
1 5 10 15 

TTC CAC ACQ CCA AGC TCC OCC AAG TCA CCC CCT TOG GAA ATO TOC COG 94 
Pha Hia Thr Pro Sar Sar Pro Lya Sar Pro Pro Sar Glu Mat Sar Pro 
20 25 30 

OCC 6T0 TOO AGO AOG AOO GTC TCC ATG CCC TCC ATG GOG GTC ACT CCC 142 
Pro Val Sar Sar Thr Thr val Sar Mat Pro sar Mat Ala val Sar Pro 
35 40 45 

TTC GTC GAA GAG GAG AGA OCC CTG CTC CTT GTG ACG CCA OCA OCG CTG 190 
Pha Val Glu Glu Glu Arg Pro Lau Lau Lau Val Thr Pro Pro Arg Lau 
SO 55 60 

OGG GAG AAG TAT GAC CAC CAC GCC CAG CAA TTC AAC TOG TTC CAC TOO 238 
Arg Glu Lya Tyr Aap Hia Rla Ala Gin Gin Pha Aan Sar Pha Hia Cya 

65 70 75 

AAC CCC GOG CAT GAG AGC AAC AGC CTG 000 CCC AGC 000 TTG AGG ATA 286 
Aan Pro Ala Hia Glu Sar Aan Sar Lau Pro Pro Sar Pro Lau Arg Xla 

80 OS . 90 95 

GTG GAG GAT GAG GAA TAT GAA AOO ACC CAG GAG TAC GAA GCA OCT CAA 334 
Val Glu Aap Gin Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pre Ala Gin 

100 105 110 

GAG COG GTT AAG AAA CTC ACC AAC AGC AGC OGG CCG GCC AAA AGA ACC 382 
Glu Pro Val Lya Lya Lau Thr Aan sar sar Arg Arg Ala Lya Arg Thr 
115 120 125 

AAG CCC AAT GOT CAC ATT GCC CAC AGG TTG GAA ATG GAC AAC AAC ACA 430 
Lya Pro Aan Gly Hia Xla Ala Hia Arg Lau Glu Mat Aap Aan Aan Thr 
130 135 140 



GGC GOT GAC AGO AGT AAC TCA GAO AGC GAA ACA GAG GAT GAA AGA GTA 476 
Gly Ala Aap Sar Sar Aan Sar Glu Sar Glu Thr Glu Aap Glu Arg Val 
145 150 155 

GGA GAA GAT ACG CCT TTC CTG GCC ATA CAG AAC CCC CTG GCA CCC AGT 526 
Gly Glu Asp Thr Pro Pha Lau Ala Xla Gin Aan Pro Lau Ala Ala sar 
160 165 170 175 
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CTC GM GOG 6CC CCT GCC 1TC C6C CTO GtC GAG AGC AGO ACT AAC CCA 574 
tma Giu Ala Ala Fro Ala Phm Arg Lau Val Aap Sar Ar9 Thr Aan Pro 
180 185 190 

ACA GGC GGC TTC TCT COG CAC GAA GAA TTG CAG GCC AGG CtC TCC GCT 622 
Thr Gly Oly Pha Sar Pro Gin Glu Giu Lau Gin Ala Arg Lau Sar Gly 
195 200 205 

GTA ATC GCT AAC CAA GAC CCT ATC GCT GTC TAAAACCGAA ATACACCCAT 672 
Val Xla Ala Aan Gin Aap Pro lla Ala Val 
210 215 

MATTCAOCT 6TAAAACTTT ATtTTATATA ATAAAGTATT CCACCTTAAA TTAAACAA 730 

(2) ZHTGRMATZQH TOR Smm»CB 2D8NTIPZCATZOR mmBBUt 144s 

(i> SBQOUCB CBAMUrTEKISTICSt 

(A) UBGtBt 1654 

(B> TTPSt aaclalc acid 

(C) SlRAHDSDSXSSt alngla 

(D) TOPnoeyi llnaar 

81Q0B9C8 OBSCRZPTZOIIt SBQ XO VOt 144 x 

AGXTTOOeCC OCCAACTTGT OOQAACTCTO GGCTOGCGOG CA6G6GAG6A GCOGAGOGGC 60 

QGCG6CT6CC C A BGCO A gOC GABC6CGGCC CGGAOCGTAA TCGCCTCTCC CTCCTCCG6C 120 

TOOOAOCGOG COQGACCOAO GCA6CGACAB GACOCCACCG CGGCCGOAAC GGAGGACTCC 160 

CCACCCGOGC GOCMCAGGA GCCACCOOCC GA6C6TGCGA CCGGG A OGCA GOGCCOGCCA 240 

GTCCCAGGTG GCCOGQACCO CAOOTTGOST COCOOCGCTC CCOOCOGGOO ACAOGAGACC 300 

CTCCOCOCCA 06C0C0G0BC GCCTCCGCCC GCTOCCTOGC CCCCCTOCAC TCCCGC6ACA 360 

AACTTTTCCC GAAGCOGATC CCABCCCTCG GACCCAAACT TGTCGCGCGT CGCCTTCGCC 420 

CCGAGCCCTC 0G06CACAGC 6TGCACTTCT CG6GGGAG ATG TC6 GAG CGC ABA 473 

Mat sar Glu Arg Arg 
1 5 

GAA GGC AAA GGC AAO 000 AAO GGC GGC AAB AAG GAC OBA GGC TCC G6G 521 
Glu Gly Lya Gly Lya Gly Lya Gly Gly Lya tya Aap Arg Gly Sar Gly 
10 IS 20 

AAC AAO CCC GTO CCC GOG GCT GGC GGC COG AGO CCA GCC TTG CCT CCC 569 

I»ya Lya Pro Val Pro Ala Ala Gly Gly Pro sar Pro Ala Lau Pro Pro 
2S 30 38 

CGC TTG AAA GAG ATC AAO ATG CAG GAG TCT 6TG CCA GOT TCC AAA CTA 617 
Arg Lau Lya Glu Hat Lya Sar Gin Glu sar Val Ala Gly Sar Lya Lau 
40 48 SO 

GTG CTT 06G TGC GAG ACC AOT TCT GAA TAC TCC TCT CTC AA6 TTC AAC 665 
Val Lau Arg Cya Glu Tlir Sar Sar Glu Tyr Sar Sar Lau Lya Pha Lya 
55 60 65 

TGG TTC AAO AAT GGO AGT GAA TTA AGC OCA AAO AAC AAA CCA CAA AAC 713 
Trp Pha Lya Aan Gly Sar Glu Lau Sar Arg Lya Aan Lya Pro Gin Asn 

70 75 80 85 

ATC AAC ATA CAG AAA AGG CCO GOG AAG TCA GAA CTT CGC ATT AGC AAA 761 
Xla Lya Xla Gin Lya Arg Pro Oly Lya sar Glu Lou Arg Xla Sar Lya 
90 95 100 
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606 TCA CTG 6CT GAT TCT GGA GAA. TAT ATO TGC AAA GTG ATC AGC AAA 809 
Ala Smr tau Ala Asp sar Gly Glu Tyr Mat Cys Lys Val Zla Sar Lys 

105 110 lis 

CTA GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG TCA AAC 857 
Lau Gly Asn Asp 5sr Ala Sar Ala Asa Zla Thr Zla Val Glu Sar Aan 
120 125 130 



GAG ATC AGC ACT GGC ATG CCA GCC TCA ACT OAG ACA GOG TAT GTG TCT 905 
Glu Zla Thr Thr Gly Mat Pro Ala Sar Thr Glu Thr Ala Tyr Val sar 
135 140 145 

TCA GAO TCT OCC ATT A6A ATA TCA GTA TCA ACA GAA GGA ACA AAT ACT 953 
Sar Glu Sar Pro Zla Arg Zla Sar Val Sar Thr Glu Gly Thr Aan Thr 

150 . 185 160 165 

TCTTCATOCACATCCACATCTACAGCTGGGACAAGCCATCTT GTC AA6 1001* 
Sar Sar Sar Thr Sar Thr Sar Thr Ala Gly Thr Sar Urn Lau Val Lys 

170 175 180 

TOT GCA GAG AAO GAG AAA ACT TTC T6T GTG AAT GGA GGC GAG TGC TTC 1049 
Cys Ala Glu Lys Glu Lys Thr Pha Cys Val Asa Gly Gly Glu Cys Pha 
185 190 195 

ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC TTG TGC AAC TGC CCA 1097 
Mat Val Lya Asp Lau Sar Aan Pro Sar Arg Tyr Lau Cys Lys Cys Pro 
200 205 210 

AAT GAG TTT ACT GOT GAT OGC TGC CAA AAC TAC GTA ATG GCC AGC TTC 1145 
Aan Glu Pha Thr Gly Asp Arg Cya 61a Asn Tyr val Not Ala Sar Pha 
21S 220 225 

TAC AGT AOO TOO ACT CCC TTT CTG TCT CTG OCT GAA TAGOOGCATG 1191 
Tyr Sar Thr Sar Thr Pro Pha Lau Sar Lau Pro Glu 
230 23S 240 



CTCAGTOOGT 


60C6CTTTCT 


TGTTGCOQCA 


TCTOCCCTCA 


GATTCAACCT 


AGAGCTAGAT 


1251 


GOGTTTTACC 


AGGTCTAACA 


TTOACTGOCT 


CTGOCTGTOO 


CATGAGAACA 


TTAACACAAG 


1311 


OQATTCTATG 




TCOGTGACTA 


GT6G6CTCXG 


AGCTACTOCT 


AG6TGCGTAA 


1371 


GGCTCCAGTG 


TTTCTGAAAT 


TGATCTTGAA 


TTACT6T6AT 


AOGACATGAT 


A6TCCCTCTC 


1431 


ACCCA6TGCA 


ATOACAATAA 


A06CCTTGAA 


AAGTCTCACT 


TTTATTOAGA 


AAATAAAAAT 


1491 


C6TTCCAOGG 


GACACTCCCT 


CTTCTTTATA 


AAAT6ACCCT 


ATCCTTGAAA 


AGGAGGTGTC 


1551 


TTAA6TTGTA 


ACCAGTACAC 


ACTTGAAATG 


AT06TAA6TT 


CGCTTCQGTT 


CAOAATGTGT 


1611 




AAATAAACAO 


AATAAAAAAA 


AAAAAAAAAA 


A 




1652 



(2) ZHfORKATZOIf POR SBOOBIICS ZOBNTZPZCATZOH ROMBBRs 14Sf 
(i) SEQDBirCS CaAAACTERZSTZCSt 

(A) LSnOTBt 1140 

(B) TtPli nuelale acid 

(C) STRAIlDEDHBSSt Singla 

(D) TOPOLOGYt liaaar 

(xi) SBGOEHCB DESCRZPTZONt 8EQ ZD MOs 145 1 
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CAT CAN 6TC TOO GCG OCG AAA GCC GC6 GCC TTG AAG AAG GAC TOC CTG 48 
His Gin VAl Trp Ala Ala Lys Ala Gly Gly Lmu Lym Lya Asp S«r Lau 

15 10 15 

C9C ACC GTO CeC CTG GOC GCC TOG G6C CAC CCC GCC TTC CCC TCC T6C 96 
Lgu Thr Val hzg Lm Gly Ala Trp Gly Hia Pre Ala Pha Pro Smr Cya 
20 25 30 

GG6 06C CTC AAG GAG GAC AGC AGO TAC ATC TTC TTC AT6 GAG CCC GAG 144 
Gly Ar9 XM Lym Glu Asp 8ar Arg Tyr Zla Pha Pha Nat Glu Pro Glu 
35 40 45 

GCC AAC AGC AGO GGC G66 CCC GCC CGC CTT COG AGC CTC CTT CCC CCC 192 
Ala Aan Sar sar Gly Gly Pro Gly Arg Lau Pro 8ar Lau Lou Pro Pre 
50 55 60 

TCT CCA GAC GGG CCC GAA CCT CAA GAA GGA GGT GAG CCO GGT GCT CTG 240 
8ar Arg Aap Gly Pro Glu Pre Gin Glu Gly Gly Gin Pro Gly Ala Val 
65 70 75 80 

CAA 06G TGC GCC TTG CCT CCC CGC TTG AAA GAG ATC AAG ACT CAG GAG 288 
Gin Arg cya Ala Lau Pro Pro Arg Lau Lya Glu Mat Lya Sar Gin Glu 
85 90 95 

TCT CTG CCA GGT TCC AAA CTA GTC CTT CGG TGC GAG ACC ACT TCT GAA 336 
sar Val Ala Gly Sar Lya Lau Val Lau Arg Cya Glu Thr Sar Sar Glu 
100 105 110 

TAC TCC TCT CTC AAG TTC AAG TGC TTC AAG AAT GGG ACT GAA TTA AGC 384 
Tyr Sor Sar Lau Lya Pha Lya Trp Pha Lya Aan Gly sar Glu Lau Sar 
US 120 125 

OOA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA A6G CC6 GGG AAG 432 
Arg Lya Aan Lya Pro Glu Aan Zla Lya Zla Gin Lya Arg Pro Gly Lya 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GOG TCA CTG GCT GAT TCT GGA GAA TAT 480 
Sar Glu Lau Arg Zla Sar Lya Ala Sar Lau Ala Aap Sar Gly Glu Tyr 
145 ISO 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AOT GCC TCT GCC AAC 528 
Nat cya Lya Val Zla sar Lya Lau Gly Aan Aap Sar Ala Sar Ala Aan 

165 170 175 

ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG AGA 576 
Zla Thr Zla Val Glu Sar Aan Ala Thr sar Thr Sar Thr Ala Gly Thr 
180 185 190 

AOC CAT CTT GTC AAG TOT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 624 
sor Hia Lou Val Lya Cya Ala Glu Lya Glu Lya Thr Pha Cya Val Aan 
195 200 205 

GGA GGC GAG TGC TTC ATG CTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 672 
Gly Gly Glu Cya Pha Nat Val Lya Aap Lau Sar Aan Pro Sor Arg Tyr 
210 215 220 

TTG TCC AAG TCC CAA CCT GGA TTC ACT GGA GCC AGA TGT ACT GAG AAT 720 
Lau Cya Lya Cya Gin Pro Gly Pha Thr Gly Ala Arg Cya Thr Glu Aan 
225 230 235 240 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAC TTT ACT 768 
Val Pro Nat Lya Val Gin Thr Gin Glu Lya Cya Pro Aan Glu Pha Thr 
245 250 255 

GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC ACT ACG TCC 816 
ly Aap Arg Cya Gin Aan Tyr Val Nat Ala Sar Pha Tyr Sar Thr sar 
260 265 270 
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ACT CCC TR CT6 TCT CTO CCT GM TACCGCATCt CMT06616C CCCTTTCTTO 870 
Thr Pro Phe Ima 8mx Lm Pro 61u 
275 280 

TXOCOOGATC TOCOCTCMa TMORCCTM MCTACAtCC GtTTXAC»6 GTCXMCUTT 910 

GACTOOCTCT GC9CTGNGCA laMAACMT AMMMGOQ ATTGTAttSM TyCCTCTCTC 990 

GBTBACncr GGGCZCXCM CXRCTOGTM 6ZG08TAM6 CTCCA01GTT TCTGJUUlTTe 1080 

AXCTXGMTT ACXGXOMAC GACIiXOASM TCOCTCTCAC OCACTOCMT GACMTMUUS 1110 

GCCTTOlUUUk CTCMU tf UUUk MMUUUUUUk 1140 



(2) XNFOmMTXm rOR S m W C Z ZDCNTZPICATKNI NUKBSRs 14€s 

(i) Sa qO BHa E CSMIMTERZSTieSi 

ih) IMCm t 1764 

(8) TTPBt nueloie aeid 

<C) STIUanDXDHSSSx siaglo 

(D) TQPOLOGys llMAT 

(xl) SBQOBSCB DSSCRZPTZOM: SBQ ZD NOs X46t 

G AAG TCA OAA CTT OGC ATT AOC AAA GOG TCA CTG CCT GAT TCT G6A GAA 49 
Lyo Sor Glu Lm hgq IXm Smr Lys Ala Sor Lou Ala Asp Sor Gly Glu 

1 $ 10 XS 

TAT AXO T6C AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC 97 
Tyr Hot Cya Lya Vol Zlo sor Lys tou Gly Asn Asp Sor Als Sor Alo 

20 25 30 

AAC ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA OCT GG6 145 
Asa Zlo Thr Zlo Val Glu Sor Asn Ala Thr Sor Thr Sor Thr Ala Gly 
35 40 45 

ACA AGC CAT CTT GTC AAG TGT GCA GAG AAC GAG AAA ACT TTC T6T OTG 193 
Thr Sor Bis Lou Val Lys Cys Ala Glu Lys Glu Lys Thr Pbo Cys Val 
SO 55 60 

AAT GGA GGC GAC TGC TTC ATC GTG AAA GAC CTT TCA AAT CCC TCA AGA 241 
Asa Gly Gly Asp Cys Pho not Val Lys Asp Lou Sor Asn Pre sor Aro 
65 70 75 80 

nC TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA 006 AGA TOT ACT GAG 289 
Tyr Lou cys Lys Cys Gla Pre Gly Pho Thr Gly Ala Arg Cys Thr Glu 
85 90 95 

AAT OTO CCC ATO AAA GTC CAA ACC CAA GAA AAA GCO GAG tM CTC TAC 337 
Asa Val Pro Not Lys Val Gin Thr Gla Glu Lys Ala Glu Glu Lou Tyr 
100 105 110 

CAG AAG AGA GTG CTC ACC ATT ACC GGC ATT TCC ATC GCG CTG CTC GTG 38S 
Gla Lyo Arg val Lou Thr Zlo Thr Gly Zlo Cys Zlo Ala Lou Lou Val 
lis 120 125 

GTT GGC ATC ATO TGT GTG GTG GTC TAC TGC AAA ACC AAG AAA CAA C6G 433 
Val Gly Zlo Not Cyo Val Val val Tyr cys Lys Thr Lys Lys Gin Arg 
130 135 140 

AAA AAG CTT CAT GAC 06G CTT CCO CAG AGC CTT CGG TCT GAA AGA AAC 481 
Lys Lys Lou Hlo Asp Are Lou Arg Gin sor Lou Arg Sor Glu Arg Asn 
145 150 155 160 
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hoc KTG A9G AAC GTA OCC AAC GGG CCC GAC CAC CCC AAT COG CCC CCC 529 
Thr Mt Mt Asa v*l Ala hmn Gly Pro Hi» His Pro Asn Pro Pro Pro 

165 170 175 



GAB AAC GTO CAG CTG GTG AAT CAA TAC GTA TCT AAA AAT GTC ATC TCT 577 
Gltt Asn Val Ola Lou Val Am 61a Tyr Val Sor Lys Asa Vsl tie Ssr 
ISO 185 190 



AOC GAG CAT ATT GTT GAG AAA GAG GGG GAB AGC TCT TTT TCC ACC AGT 625 
Sgt Glu Bis ZlG Vsl Gltt Arg Glu Ala Glu sar Sgt Pte 8mr Thr Ssr 
195 200 205 

CAC TAC ACT TOG ACA OCT CAT CAT TOC ACT ACT GTC ACT CAG ACT CCC 673 
Bis Tyr Thr Smr Thr Ala Bis Bis Ssr Thr Thr Vsl Thr Gla Thr Pro 
210 215 220 

ACT CAC AGC TOG AGC AAT GQA CAC ACT GAA ACC ATC ATT TOG CAA ACC 721 
Ssr Bis Ssr Trp Ssr Asa Gly Bis Thr Glu ssr lis lis ssr Glu Ssr 
225 230 235 240 

CAC TCT GTC ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC AGC AGC 769 
Bis Ssr vsl Zls Vsl Mat Ssr Ssr Vsl Glu Asa Ssr Arg His Ssr ssr 
245 250 255 

COG ACT GGG GGC COG AOA GGA OCT CTC AAT GGC TT6 CGA CGC OCT OCT 817 
Pro Thr Gly Gly Pro Arg Gly Arg Lsu Asa Gly Lsu Gly Gly Pre Arg 
260 265 270 

GAA TGT AAC AGC TTC CTC AGG CAT GCC AOA GAA ACC CCT GAC TCC TAC 865 
Gltt Cys Asa Ssr Phs Lsa Arg Bis Als Arg Gltt Thr Pro Asp Ssr Tyr 
275 280 285 

OQA GAC TCT CCT CAT AGT GAA AOA CAT AAC CTT ATA GCT GAG CTA AGG 913 
Arg Asp ssr Pro Bis Ssr Glu Arg Bis Asa Lsu lis Als Glu IM Arg 
290 295 300 

ABA AAC AAG GCC CAC ABA TCC AAA TGC ATB CAG ATC CAB CTT TCC GCA 961 
Arg Asa X.ys Als Bis Arg Ssr Lys Cys Kst Gla Zls Ola Im Ssr Als 

305 310 315 320 

ACT CAT CTT AOA GCT TCT TCC ATT CCC CAT T66 GCT TCA TTC TCT AAG 1009 
Thr Bis Lsu Arg Als Ssr Ssr Zls Pro His Trp Als Ssr Phs ssr Lys 
325 330 335 

ACC OCT TGG CCT TTA GGA AGG TAT GTA TCA GCA ATG ACC ACC COG GCT 1057 
Thr Pro Trp Pro Lsu Gly Arg Tyr Vsl Ssr Als Hmt Thr Thr Pro Als 
340 345 350 

OCT ATG TCA CCT GTA GAT TTC CAC AC6 CCA AGC TCC CCC AAC TCA CCC 1105 
Arg Mst Ssr Pro Vsl Asp Phs Bis Thr Pro Ssr Ssr Pro Lys ssr Pro 
3SS 360 365 

CCT TOG GAA ATG TCC COG CCC CTG TCC ACC AOB ACC GTC TCC ATG CCC 1153 

Pro Ssr Glu Mst Ssr Pro Pro Vsl Ssr Ssr Thr Thr Vsl Ssr Mst Pro 

370 375 380 

TCC ATG GOG CTC AGT CCC TTC CTG GAA GAG GAG AGA CCC CTG CTC CTT 1201 
ssr Mst Als Vsl Ssr Pro Phs Vsl Glu Glu Glu Arg Pro Lsu Lsu Lsu 
385 390 395 400 

GTG AOG CCA CCA CGG CTG CG6 GAG AAG TAT GAC CAC CAC GCC CM CAA 1249 
Vsl Thr Pro Pro Arg Lsu Arg Glu Lys Tyr Asp Bis Bis Als Gla Gla 
405 410 415 

TTC AAC TOG TTC CAC TGC AAC CCC GOG CAT GAG AGC AAC AGC CTG CCC 1297 
Phs Asa Ssr Phs His Cys Asa Pro Als Bis Glu ssr Asa ssr Lsu Pro 
420 425 430 
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CCC MC CCC TIG AGO ATA GTC GAG GAT GAG GAA TAT CAA AC6 ACC GAG 1345 
Pre Smr Pro Lou Arg Zlo Val Glu Asp Glu Glu Tyr Glu Thr Thr Gin 
435 440 445 

GAG TAG GAA CCA GCT CAA GAG CCC GTT AA6 AAA CTC ACC AAC ACC AGC 1393 
Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lya Lau Thr Ann Sar Sor 
450 455 460 

OGG COG GCC AAA AGA ACC AAC CCC AAT GGT CAC ATT GOC CAC AGG TTG 1441 
Aro Ara Ala Lyo Arg Thr Lya Pro Aan Gly Hia Zlo Ala His Arg Lou 
465 470 475 - 



GAA ATG GAC AAC AAC ACA G6C GCT GAC AGC ACT AAC TCA GAG AGC GAA 1489 
Glu Mot Aap Aan Aan Thr Gly Ala Aap Sor Sor Aan Sor Glu 8or Glu 
485 490 495 

ACA GAG GAT GAA AGA GtA GGA GAA GAT ACQ CCT TTC CIG GCC ATA CAG 1537 
Thr Glu Asp Glu Arg Val Gly Glu Aap Thr Pro Pho Lou Ala Zlo Gin 

500 505 510 

AAC CCC CTC GCA GCC AOT CTC GAG GCG GCC CCT GCC TTC C6C CTG GTC 1585 
Aan Pro Lm Ala Ala Sar Lou Glu Ala Ala Pro Ala Pho Arg Lou val 

S15 520 525 

GAC AGC AGG ACT AAC CCA ACA GGC G6C TTC TCT CCG CAG GAA GAA TTG 1633 
Aap Sor Arg Thr Aan Pro Thr Gly Gly Pho sor Pro Gin Glu Glu Lou 
530 535 540 

CAG GCC AGO CTC TCC GGT OTA ATC GCT AAC CAA GAC CCT ATC CCT GTC 1681 
Gin Ala Arg Lou Sor Gly Vol Zlo Ala Aan Gin Aap Pro Zlo Ala Val 
545 550 555 560 

TAAAACCCAA ATAGACCCAT AOATTCACCT GTAAAACTTT ATTTTATATA ATAAA6TATT 1741 

OCACCTTAAA TTAAACAAAA AAA 1764 

(2) ZHP08MATZ0N FOR S8QUEMCE ZDSNTZPXCATZON NUMBER: 147 1 

(i| SBQU8MCE CaARACTCRZSTZCSt 

(A) LENOTBs SO 

(8) TTPEi anino acid 

(C) STRAHDBDNBSSt 

CO) TOPGLOOYi llAoar 

(xi) SEQUENCE DESCRZPTZONs 8BQ ZD ROt 147 1 

Lyo Cyo Ala Glu Lyn Glu Lya Thr Pho Cya Val Aan Gly Gly Glu Cyo 

1 5 10 15 

Pho Mot Val Lya Aap Lou Sor Aan Pro Sor Arg Tyr Lou Cys Lya Cya 

20 25 30 

Pro Aan Glu Pho Thr Gly Aap Arg Cya Cln Aan Tyr Val Mot Ala sar 
35 40 45 

Pho Tyr 
50 

.(2)*XNPORMATZON FOR SEQUENCE ZDENTXFZCATZON NUMBERS 148t 

(1) SEQUENCE CBARACTERZSTZCSs 

(A| LENGTH: 50 

(8) TTPE: amino acid 
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{C| STRMIDZONBSSt 

(D) tOMljOGYt linmmx 

(xi> SSQUSNCB DBSCnZPTZ Nl SEQ ZO NO: 148 1 

lym Cym Ala OXu Lym Olu Lys Thr Phm Cys V«l Asn Gly Cly Glu Cy« 

^5 10 15 

Phm Hot V«I Lys Asp Lm Sor Asn Pro Sor Arg Tyr Lou Cya Lym Cya 

20 25 30 

Gla Pro Gly Phe Thr Gly Al« Arg Cy» Thr Glu hmn Val Pro Hmt Lym 
35 40 45 

VaI Gin 
50 

(3) mrORMATZOIf fOR SBQOBirCE ZSSNTXPZCATXOII NOMBERt 149i 

ID SBQDBHCB GKMUICrSIIZSTZCSi 

(A) IMO THt 46 

(B> TCTBt amino acid 

(C> STBAIIDBDlfBSSi 

(D> TOTQLOGts llnoar 

Cxi) 8BQ0BRCE OBSCRZPTZONt SBQ ZD MOt 149 < 

Olu Cya Lou Arg Lya Tyr Lya Aap Pha Cya Zla Hia Gly Glu Cya Lya 

is 10 15 

Tyr val Lya Glu Lau Arg Ala Pro Sar cya Lya cya Gla Gin Glu Tyr 
30 35 30 

Pha Gly Glu Arg Cya Gly Glu Lya sar Aan Lya Thr Bia Bar 
35 40 45 

(3) ZNTOIUIATZOB FOR SBODBNCB ZOSNTZPZCATZON NmCBBR: 150s 
(i) SBQOBNCB CRMACTERZSTZCS: 

(A) LBHG TH: 198 

(B) TtPSs nuclaic acid 
(C| STRANDSDHSSSs aiagla 

(D) TOMLOGyt liaaar 

(Xi) BBQ^IBBCB DBBCRZPTZOHt SBQ ZO NOt ISOt 

AGO CAT CTT GTC AM T6T GCA GAG AM GAG AAA ACT TTC T6T GTG AAT 48 
Sar Bia Lau val Lya Cya Ala Glu Lya Glu Lya Thr Pha Cya Val Aan 

15 10 15 

GGA GGC GAG TCC TTC ATO GTO AAA GAC CTT TCA AAT CCC TCA AGA TAC . 96 

Gly Gly Glu Cya Pha Mat Val Lya Aap Lau Sar Aan Pro Sar Aro Tyr 
20 35 30 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT OGC TGC CAA AAC TAC 144 
Lau Cya Lya Cya Pro Aan Glu Pha Thr Gly Aap Arg Cya Ola Aaa Tyr 
35 40 45 

GTA ATG GCC AGC TTC TAC ACT ACG TCC ACT CCC TTT CTG TCT CT6 CCT 193 
Val Mat Ala Sar Pha Tyr Sar Thr Sar Thr Pro Pha Lau Sar Lau Pro 
50 55 60 
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CM TAG 

Glu 

65 



(2) XHVORMIiTZON FOR SEQOZIICB ZOSIITZFZCATXON NUMBERt 151 1 

(1) SSQUEBOS CBARACTBRZSTZCSt 

ih) L8HGIBI 192 

<B) TTVIi aueioie Mid 

(C) STRMnnPflBSSt •ingla 

(D) TOKiLOGiri linaar 

(Xl) SBQQBSia OBSdUPTZON: SEQ ZD NOi 151s 

MtC CAT CTT GTC AM T6T GCh GM AAG GJUS AAA ACT TTC TOT 6TG AAT 48 
Sat Bis Xm Val Lys Cys Ala Glu Lys Glu Lys Thr PhG Cys Val Ami 

1 5 10 15 

GGA G6C GAG TGC TTC ATG GT6 AAA GAC CTT TCA AAT CCC TCA A6A TAC 96 
Gly Gly Glu cys Phm Mat Val Lys Asp Lsu Ssr Asa Pro Ssr Arg Tyr 

20 25 30 

no TGC AAG TGC CAA CCT GGA TTC ACT GGA CCO A6A T6T ACT GAG AAT 144 
Lsu Cys Lys Cys Gin Pro Gly Phs Thr Gly Aim hxq Cys Thr Glu Asn 
35 40 45 

GTO CCC ATG AAA GTC CAA AOC CAA CAA AAA GOG GAG GAG CTC XAC TAA 192 
vsl Pro Mt Lyo Vsl Gla Thr Gin Glu Lys Ala Glu Glu Lou Tyr 

SO .55 60 



PCT/US94/05083 
198 



(2) ZBTOIUCATZON FOR SBQOBBCE XDENTZFZCATZOM NUMBER: 152s 

(1) SBQUBRCX caARACTBRZSTXCSs 

(A) ZA»THt 183 

(B> TTPBt nuclsic acid 

(C) 6SRABD8DBBS8S siilGlS 

(D) TOPOtOGTt liSMr 

(Xl) SBQOERCB DESCRZPTZONi SBQ ZD NOt 152s 

AOC CAT CTT GTC AAG T6T GCA GAG AAG GAG AAA ACT TTC tOT GTC AAT 48 
Ssr His Lsu Vsl Lys Cys Als Glu Lys Glu Lys Thr Phs Cys Vsl Asn 

15 10 15 

GGA CGC GAG TGC TTC ATG GTO AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Oly Glu Cys Phs Not Vsl Lys Asp Lsu Ssr Asn Pro Ssr Arg Tyr 
20 25 30 

TTO TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 144 
LOU Cys Lys Cys Pro Asa Glu Phs Thr Gly Asp Arg Cys Gla Asa Tyr 
35 40 45 

GTA ATG GOC AOC TTC TAC AAA 606 GAG GAG CTC TAC TAA 183 
Vsl Mst Als ssr Phs Tyr Lys Als Glu Glu Lsu Tyr 
50 55 60 



(2) ZNPORMATZON FOR 8BQ0BHCB ZDBNTZFZCATZOH NUMBERS 153s 
(i) 8BQU8RCB CBARACTSRZSTZCS i 

(A) LSBGTBs 210 

(B) TTPBs auelsic seid 
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(C) STRAMDSDNESSt minqlm 

(D) TOMLOGYi llMmr 

(Xi) SBODSNCB OBSGRZPTXOHt SBQ ZO HOt 153 1 

AOC CKX CTI CTC AM TCT CCA 6AC AAC GAG AAA ACT TTC TGT CTC AAT 48 
Mr Hi* Lau Val Lym Gym Ala Clu tys Glu Lys Thr Pha Cya Val Aan 
1 5 10 15 

GGA G6C GAG TGC TTC ATG 6T6 AAA GAC CTT TCA AAT CCC TCA A6A TAC 96 
Gly Oly oiu Cya Pha Mat Val Lya Aap Lau Sar Aan Pro Sar Arg Tyr 
30 25 30 

TIG TGC AA6 TGC CCA AAT GAC TTT ACT GOT GAT COC TGC CAA AAC TAC 144 
Laa Cya Lya Cya Fro Aaa ciu Pha Thr Gly Aap Aro Cya Gla Aaa Tyr 
3S 40 45 

6TA ATG GCC A6C TTC TAC AAG CAT CTT GGG ATT CAA TTT ATG GAG AAA 192 
Val Mat Ala Sar Pha Tyr Lya Hia Lau Gly lla Glu Pha Mat Glu Lya 
50 55 60 

GOG GAG GAG CTC TAC TAA 210 

Ala Glu Glu Lau Tyr 

65 

<2) IMfCmATZON FOR S gQO gM C E ZOBNTZFICATZW tniKBERt 154s 

(1) SBQOBHCB CBAIIACTBIlZSTICSi 

(A) LlBC THi 267 

(8) TTP8t auclaic aeidi 

(C) gTBA HBgDHESS; aiagla 

<D) TOPOLOOTt liaaar 

(Mi) sm m ui a i obscrzptzons sbq zo nos 154s 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TOT GTG AAT 48 
Sar Hia Lau Val Lya Cya Alh Glu Lya Glu Lya Thr Pha Cya Val Aaa 
^ S 10 15 

GGA GGC GAG TGC TTC ATO GTG AAA CAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cya Pha Mat Val Lya Aap Lau Sar Aan Pro Sar Aro. Tyr 
20 25 30 

TTG TGC AAG TOO CAA OCT GGA TTC ACT GGA GOO AGA TGT ACT GAG AAT 144 
Lau Cya Lya Cya Ola Pro Oly Pha Thr Oly Ala Arg Cya Thr Glu Aan 
35 40 45 

GTG CCC ATO AAA GTC CAA ACC CAA CAA AAG TGC CCA AAT GAG TTT ACT 192 
Val Pro Nat Lya Val Ola Thr oin Glu Lya Cya Pro Aan Glu Pha Thr 

50 55 60 

GOT GAT C6C TGC CAA AAC TAC OTA ATO GCC AGC TTC TAC ACT ACC TCC 240 
Oly Aap Arg Cya Gin Aaa Tyr Val Mat Ala Sar Pha Tyr Sar Thr Sar 

65 70 75 80 

ACT CCC TTT CIO TCT CTG CCT GAA TAG 267 
Thr Pro Pha Lao Sar Lau Pro Glu 
86 

(2) mrORMATZON rOR SSQUBHCE ZOEMTZFZCATZON NUM8SR: 155s 
<1) SEQUENCE CHARACTERZSTZCSs 
(A) LERGTHt 252 
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(B) TYPBt nuclaic acid 

(C) SlRlumeOMBSSt singls 
CO) TOPOLOGYt liiiMr 

(Xi) SEQUENCB DBSCRZPTZONi SEQ ZO HOi ISSs 

ACC CAT CTT GTC AAC TCT CCA GAG AAC GAG AAA ACT TTC T6T GTC AAT 48 
smt His Lmu Val Lys Cys Ala GIu tys Glu Lys Thr Phe Cys v«i hmn 
15 10 15 

OGA GGC GAG TGC TTC AIG 0T6 AAA GAC CTT TCA AAT CCC TCA ACA TAC 96 
Gly Gly GIu Cys Phe Itet Val Lya Asp Lsu 8«r Asa Pre Smr hxq Tyr 
20 25 30 

TTC TGC AAO TGC CAA CCT 6GA TTC ACT GCA GCG iwSA TOT ACT 6AO AAT 144 
hmx Cys Lys Cys Gin Pro Gly Phs Thr Gly Ala Arg Cys Thr Glu Asa 
35 40 45 

0T6 CCC AT6 AAA GTC CAA ACC CAA QAA AAG TGC CCA AAT GAG TTT ACT 192 
val pro Hat Lys Val Gin Thr Gin Glu Lys Cys Pre Asn Glu Pha Thr 
50 55 60 

OCT CAT CGC TGC CAA AAC TAC GTA AJC CCC ACC TTC TAC AAA GOG GAG 240 
Gly Asp Arg Cys Gin Asn Tyr val Mat Ala Sar Pha Tyr Lys Ala Glu 
65 70 75 80 



GAG CTC TAC TAA 
Glu Lau Tyr 



252 



(2) ZnrOMIATZOII POR S8Q0SHCS ZOmTZPZCATZOn NOHBBRi 156 1 
(1) SBODBNCB CBABAGTBRZSTZCSs 

(A) USXtomt 128 

(B) TTPSi Ruclaic acid 

(C) STRARDSDNBSSs single 

(D) TOPOLOOTs linear 

(Xi) 8BQUSNCE 0B8CRZPTZ0N: 88Q ID NOt 156i 

CC ACA TCC ACA TCT ACA OCT G6G ACA AGC CAT CTT CTC AAC TCT GCA 
Thr ser Thr Ser Thr Ala Gly Thr 8ar His Leu Val Lye Cye Ala 
15 10 15 

OAOAAO0ACAAAACTTTCTCTGTOAATG6A0M6MTCCTTCATO 
Glu Lys Glu Lys Thr Phe Cye Val Aaa Gly Gly Glu Cys Phe Met Val 
20 25 ^® 

AAACACCTTTCAAATCCCTCAACATACTTGT6C 
Lye Aep Leu 8er Aea Pre Ser Arg Tyr Leu 
35 40 

(2) XMPOBHATZOH FOR SBQOBNCB ZDBNTZFXCATXOH NUMBBRt 157 1 
(i) 5EQ0BRCE CHARACTBRZSTZCSt 

(A) LBROTHs 141 

(B) TTPBt nucleic acid 

(C) STRANDEOHSSSs single 

(D) TOPOLOGY t linear 

<xi) SBOOENCB DESCRZPTZONs SEQ ZO NOt 157 1 



47 



95 



128 
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A CAT AAC CTT ATA OCT GAG CTA AGO A6A AAC AA6 GCC CAC AOA TCC 46 
BLm Asa Lgu Z1« Ala Glu hma Arg Arg Asn Lym Ala HLa Arg Smr 

i 5 10 15 

AAA TOC ATO CAB ATC CAO CTT TCC GCA ACT CAT CTT ABA 6CT TCT TCC 94 
Lys C/0 Mmt Gin Urn Gin hmu Smr Ala Ttir Hia Lmi Arg Ala Sar Sar 

20 25 30 

ATT CCCCATTGOGCTTCATTCTCTAACACCCCTTGCeCTTTAGGAAO 141 
Zla Pro Bis Trp Ala Sar Pha Sar Lya Thr Pre Trp Pro Lau Gly Arg 
35 40 45 



(2) IHTOMIATIOB PGR SEQUTOCS XDSHTXPZCATZOH KUXBSR: lS8s 
(1) SBQOSNCS CHARACTBUSTZCSt 

(A) LBHO THi 24 

(B) TtPB: amino acid 

(C) STWOfOSOllSSSs 

(D) TOPQLOOYi linaar 

(ix) PBATOREi 

(O) OTBn ZRPQRMATZORi Xaa in poaitions IS and 22 

ia unknown* 

(Xi> s aOTO HC l DSSCRZPTXORi SIQ ZD NOs 158 s 

Ala Ala Glu Lya Glu X»ys Thr Pha Cya Val Aan Gly Gly Glu Xaa Pha 
is 10 15 

Hat Val Lya Asp tmu Xaa Aan Pro 

20 



(2) HIPORMATZOH PGR SEQ^CE ZDBHTZPZCATZON miMBERt 159 1 
(i) SBQOBNCB CBARACTERZSTZCSi 

(A) LBHGTBi 745 

(B) TTPBs nuelaic add 

(C) sntANDSDllBSSs singla 

(D) TOPOLOGTt linaar 

(Xi) 58Q0XRCB OBSCRZPTZONt 8SQ ZD NOt 159 1 

ATO AOA TOG 08A 06C GCC COG CGC 06C TOC G6G 06T COC GGC COC 06G 48 
Mat Arg Trp Arg Arg Ala Pro Arg Arg Sar Gly Arg Pro Gly Pro Arg 

I 5 10 15 

GCC CAB COC CCC GGC TCC GCC GCC COC TCG TOG COG COB CTG CCG CZG 96 
Ala Gin Arg Pro Gly Sar Ala Ala Arg Sar Sar Pro Pro Lau Pro Lau 
20 25 30 

CTG CCA CTA CTG CTG CTG CTG GGG AOC GCG GCC CTG GCG CCG GGG GOB 144 
Lau Pro Lao Imu Lau Lau Lau Gly Thr Ala Ala Lau Ala Pro Gly Ala 
35 40 45 

GCG GGC GGC AAC GAG GCB OCT COC GOG GGG GCC TOG GT6 TGC TAC TOG 192 
Ala Ala Gly Aan Glu Ala Ala Pro Ala Gly Ala Sar Val Cya Tyr Sar 
5P 55 60 

TOC COG CCC AGO GTG GGA TOG GTC CAG GAG CTA GCT CAG CGC GCC GCB 240 
Sar Pro Pro Sar Val Gly Sar Val Gin Glu Lau Ala Gin Arg Ala Ala 
65 70 75 80 
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OTO G7G ATC GAG GGA AA6 6TC CAC CC6 CXG CGG OGG CAO CAO GGO GCA 288 

VAX VAi llm Glu Gly hym VaX His Fro Gin hxq Arg oln Gin Gly Ala 
8$ 90 95 

etc GAC AGO AAO GOO GCG GCO GOG GOG GGO GAG GCA GGG GOG TG6 GGC 336 
Lmu Asp Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala Trp Gly 
100 105 110 

GGC GAT OGC GAB 0C6 CCA 6CC 606 660 OCA 066 606 CT6 666 COO 000 384 
61y ASP Arg 61tt Pre Pro Ala Ala 61y Pro Arg Ala Lau Gly Pro Pro 

lis 120 US 

6C0 GAO GAC 006 CT6 OTO 600 600 AAO 666 ACC GTG 000 TOT TG6 COO 432 
Ala Olu Glu Pre Leu Leu Ala Ala Aon 61y Thr val Pre 8ar Trp Pro 
130 135 140 

ACO 6CC OC6 6T6 000 ACC 600 660 6A6 COO 666 6A6 6AB 600 COO TAT 480 
Thr Ala Pre Val Pre sor Ala 61y 61u Pro 61y 61u 61u Ala Pro Tyr 
145 ISO 155 160 

CT6 GT6 AAO 6T6 CAC CAO 6T6 TG6 606 6X6 AAA 600 666 660 TT6 AAG 528 
Leu Val Lya Val Hla 61n Val Trp Ala Val Lya Ala Gly 61y Im Lya 

165 170 175 

AAG 6AC T06 OTO OTO ACC 6T6 060 0X6 666 ACO T66 660 CAC CCC 600 576 
Lya Aap ser Leu Leu Thr Val Arg Leu 61y Thr Trp 61y aio Pro Ala 
180 185 190 

TTC CCC TCO Tec 666 A66 OTO AAG 6AC eAC A60 AGG TAC AXC TTC TTC 624 
Phe Pre Ser Cye Gly Arg Leu Lya Glu Aap Ser Arg Tyr Zle Phe Pha 
19S 200 205 

AXO 6A6 CCC 6AC 600 AAO A60 ACO A6C 060 GCG 006 6CC 600 TTC 06A 672 
Net Glu Pre Aap Ala Aaa Ser Thr Ser Arg Ala Pro Ala Ala Phe Arg 
210 2X5 220 

600 TOT TTC CCC OCT CTO OAO ACO G6C CGG AAO OTO AAG AAO GAG GTC 720 
AXa Ser Phe Pro Pro Leu Olu Thr 6ly Arg Aan Lau Lya Lya 61u Val 
225 230 235 240 

AGO CGG GT6 CT6 TCO AAO COG TGC 6 745 
Ser Arg Val Leu Cya Lys Arg Cya 
245 

(21 ZnPORNATZOH FOR SBQDBHOB ZOBRTZPZOATZM NOMBBR: 160 1 

(1) SBOones caAAACXSiizsTZcst 

(A) LBBOTBt 12 

(8) TXPBs amino acid 

(0) STRAROBOMBSS t 

(0) TOPOLOGYx linear 

(ix) PBATOREi 

(0> OTHBR ZNfORNATZONs Xaa in poaition 1 is 

unknown* 

(Xi) SBQOBHCB DSSCAZPTZONi 8BQ ID NOt X60t 

Xaa Ala Leu Ala Ala Ala 61y Tyr Asp Val Glu Lya 

IS xo 

(2) ZHFORMATZOH F R SSQUEHCE IDERTZPXCATION NUKBBRt X61i 
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(1) 5BQUSNCS OUUIACTBRXSTXCS: 

<A) LXHOTHs S 

(B) TTVBi amino acid 

(C) SXmilDEDllBSSi 

(0) TOPOLOGri linaar 

(ix) nhtmBt 

(D) 09HBR ZfrramHTZOHt Xaa in position 1 ia 

unknown. 

(Si) swaama OBsaaprxoNt sbq zo hoi 161: 

Xaa Lau VaX tma Arg 

1 5 



(2) ZHfORIATZOil FOR BBODBNCB ZOBRTXTZCATZON NOHBBRs 162s 
(i) BBQUBRCB CBMACTBRZBTZCSt 

(A) LBWC TH: 11 

(B) 7TPB: amino acid 

(C) STRMlDBOIIBSSs 

(0) TOMLOOYs linaar 

(in) FBAXUKBt 

(0) oraSR ZHFmiKliTZOHs Xaa in positions 1, 2, and 

3 is unknoim. 

(xi) SBQUBBCE OBSCRXPTZORt SBQ ID NOt 162 1 

Xaa Xaa Xaa Tyr Pro Gly Oln Zla Thr Sor Asn 

IS 10 . 



(2) ZNPORMATZON FOR S8Q08RCS XOBRTZFZCATZOff NUMBER: 163: 

(i) SBQUBRCE CHRRACTBRXSTZCSt 

(A) LBMOTHt 60 

(B> TtMt nuclsic aeid 

(C) STRMTOBDRBSSt singls 

(0) TOFnoQYs linaar 

(ix) FBRTORBt 

(D) OZBBR ZRFORMATZONt ff in positions 25 and 36 is 

unknoim. 

(xi) SBOOSRCS DBSCRZPTZORt SBQ ZO ROt 163 s 
RTMSGQRMO OOfSGGGORRO GGTCMCCCTC RGCRGG6C0G G6CTTGCCTC TGGRGCCTCt 60 



(2> ZRFORMRTION FOR SBOOBNCB ZOBRTZFZCRTZQN NUMBERS 164t 
(i) S8Q0BRCB CRRRRCTERZSrZCSs 

(R) tEHOTBs 18 

(B) TTPBs nuclaic aeid 

(C) STRRNDEDNBSSs sin9lo 

(D) TOPOLOGlTt linaar 

(ix) PSRTORBs 
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<P) OTHER XNFORMHTSONi N in position 16 is 

unknom. 

<xi) SBQOBRCB DESCRXPTZOHx SSQ 10 NO: 164: 
TTTACACATA TATTCHCC 1« 

(2) ZNFOIIMATION FOR SBQaBBCB ZDBRTZrXCATZOH NOMBCRi 16St 
(1) SSQOENCE CHMACTBRXSTZCSt 

ih) LSHCTBi 21 

(B) rmi amino acid 

(C) STRARDSDRSSSs 

(D) TOMLOOYi linoar 

(Xl) SEOmRCS DESCRZPTIOHt SEQ ID RO: 16St 

Glu Thr Gin Pro Asp Pro Gly Oln Zlo Lou Lys Lys Val Pro Mat Val 

15 10 15 

Zle Gly Ala Tyr Thr 
20 

(2) ZRVORMATZOH FOR SBQDBRCE ZDSRTZrZCATZON RUHRBR: 166: 
(i) 8E Q0 E R C E GRRRACTBRXSTZCSt 

(A) LRHGTBt 422 

(B) TTPB: amino acid 

(C) STRARDEDRBS8: 

(D) TOPOLOGT: linaar 

(xi) SXQtJERCB OSSCRZPTZOR: SEQ ID RO: 166: 

Mat Ar9 Trp Ar? Arg AU Fro Arg Arg Sar Gly Arg Pro Gly Pro Arg 
15 10 15 

Ala Gin Arg Pro Gly Sar Ala Ala Arg Sar Sar Fro Pro Lau Pro Lau 

20 25 30 

Lau Pro Lau Lau Lau Lau Lau Gly Thr Ala Ala Lau Ala Fro Gly Ala 
35 40 45 

Ala Ala Gly Aan Glu Ala Ala Fro Ala Gly Ala Sar Val Cys Tyr sar 
50 55 60 

Sar Fro Fro Sar Val Gly Sar Vml Gin Glu Lau Ala Gin Arg Ala Ala 

65 70 75 • 80 

val Val Zla Glu Gly Lya Val His Fro Gin Arg Arg Gin Gin Gly Ala 
8S 90 95 

IM Asp Arg Lya Ala Ala Ala Ala Ala oly Glu Ala Gly Ala Trp Gly 

100 105 110 

Gly Asp Arg Glu Fro Fro Ala Ala Gly Fro Arg Ala Lau Gly Fro Fro 
115 120 125 

Ala Glu Glu Fro Lau Lau Ala Ala Asn Gly Thr val Fro Sar Trp Fro 
130 135 140 

Thr Ala Fro Val Fro sar Ala Oly Glu Fro Gly Glu Glu Ala Fro Tyr 
145 150 155 160 
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Utt VAl hym Val Bis OXn Val Trp Ala VaX Lya Ala CXy Gly Mu Lya 

165 170 175 

Lya Asp 8mr Lau Lau Thr Val Arg Lau Gly Thr Trp Gly Hia Fro Ala 

180 IBS 190 

Pha Pro Sar Cya Gly Arg Lau Lya Glu Aap sar Xrg Tyr Ila Pha Pha 

195 200 205 

Nat Glu Pro Aap Ala Aaa Sar Thr Sar Arg Ala Pro Ala Ala Pha Arg 

210 2XS 220 

Ala sar PHa Pro Pro Xm Glu Thr Gly Arg Asn Lau Lya Lys Glu Val 
225 230 235 240 

Sar Arg Val Lau Cys Lya Arg Cys Ala Lau Pro Pro Gla Lau Lya Glu 
245 250 255 

Nat Lya Sar Gin Glu Sar Ala Ala oly Sar Lya Lau Val Lau Arg Cys 

260 265 270 

Glu Thr sar Sar Glu Tyr Sar sar Lau Arg Pha Lys Trp Pha Lys Asn 
275 280 285 

Gly Asn Glu Lau Asn Arg Lya Asn Lys Pre Gin Asn Zla Lys Zla Gin 
290 295 300 

Lys Lys Pro Gly Lys Sar Glu Lau Arg Zla Asn Lys Ala Sar Lau Ala 
305 310 315 320 

Asp sar Gly Glu Tyr Mat Cys Lys Val Zla Sar Lya Lau Gly Aaa Aap 

325 330 335 

Sar Ala Sar Ala Asa Zla Thr Zla val Glu Sar Asn Ala Thr Sar Thr 

340 345 350 

Sar Thr Thr Gly Thr Sar Hla Lau Val Lys Cys Ala Glu Lys Glu Lys 
355 360 365 

Thr Pha cya Val Asn Gly Gly Glu Cys Pha Mat Val Lys Asp Lau Sar 

370 375 380 

Aaa Pro Sar Arg Tyr Lau Cya Lys Cys Pro Asn Glu Pha Thr Gly Asp 
385 390 395 400 

Arg cya Gin Asa Tyr Val Mat Ala Sar Pha Tyr Sar Thr Sar Thr Pro 

Pha Lau Sar Lau Pro Glu 
420 



410 415 



(2) ZMrORMAnOH FOR SSQOBMCe XDIHTZFZCATZON NDMBSRi 167t 

(1) SEQUBHCB CBMIACTEMZSTZCSl 

CA> LBHQTHt 69 

(B> rmi amino acid 

(C) STmilDBDRKSSt 

. (D) TOPOLOGTi liaaar 
<Si) SWaOmCB OSSCRZPTZONi S8Q ZD ROt 167 s 

Nat Sar Glu Arg Lys Glu Gly Arg Gly Lya Gly Lya Gly Lya Lya Lya 

15 10 15 
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Olu Arg 6Xy 8«r Qly Lym Lys Pr Olu S#r Ala Ala Gly 8«r Gin SGr 

20 25 30 

Pro Arg Glu Zla if Thr Oly Hat Pre Ala sar Thr Glu Gly Ala Tyr 
3S 40 4S 

Val Sar Sar Glu Sar Pro Zla Arg Zla Sar VaX sar Thr Glu Gly Ala 

SO SS 60 

Aan Thr Sar Sar Sar 

65 

(2) nfORMATlON POR SSQDBHCS ZDSRZFZCATZON NUMBms 1681 

(1) S8Q0BRCB CHARACIBRZSTZCSl 

|A| LSHGTBs 19 

(B> TTPSt aaloo acid 

(C) STRAHDSDRBSSt 

(D) TOPOLOcy: linaar 

(xl) SBQOERCS DSSCIUPTZOlls SEQ ZD NO: 168: 

Arg Lya Gly Aap Val Pro Gly Pro Arg VaX Lya Sar Sar Arg Sar Thr 

15 10 IS 

Thr Thr Ala 

(2> ZmORIIATZOII POR SBOmiCB ZOBHTZPZCATZOli NOKBSRi 169: 
(1) SBQUBHCS CSAMCTBRXSTZeSt 

{h) LSHGTBt 231 

(B) TTPBt nuclaie acid 

(C) STBAMDIOHBSSt aingla 

(D) TOPOLOOTi llnaar 

(xi) SBQOBBGS DBSOUPTZONs SBQ 20 NOt 169 1 

0606AG0GGC TCAGOOCGGC 06CT06CTCT CCCCC TCGAG CGACAAACTT TTCCCAAACC 60 

CGATC06AGC CCTTGGACCA AACTOGCCIG COCCOAGAGC CGTC0G06TA GAOGGCTCOC 120 

TCTCOGGCGA GATGTCC6AG COCAAAOAAO GCMAGGCIJi ACGO A AGCC C AA PA A O AACG 180 

A60GA60CTC OOGCAABAAG CGOGMBTCOG CCCCCGGCAO CCAC A CCCCA G 231 

(2) ZNPORMATZON POR SEQOBNCX ZD8NTZPZCATZ0N NUKBERt 170i 
CD SBQOBNCS CBARACTBRZSTZCSt 

<A| ZnOTBt 178 

(B) TTVBi mselaic add 

(C) STRASDBDNBSSs aiagla 
|D) TOPOLOGYs linaar 

Cxi) SEQUENCE DBSCRZPTZONt SBQ ZD NOi 170: 

CCTTCCCTCC C06ATTCAAA GACATGAAAA CCCACCAATC CCCTGCAGGT TCCXAACTAG 60 

TCCTTOGGTG TCAAACCACT TCTOAATACT CCTCTCTCAC ATTCAA0T6C TTCAAOAATC 120 

GGAATQAATT 6AAT0CAAAA AACAAACCAC AAAATATCAA GATACAAAAA AA6CCACG 178 
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(2) ZRFOmUiXZQM MA SBQOSHCB ZOBHTZPXCJITZON NOMB»s 171t; 

(i) SEQ0ENCE CHMACTSRZSTZCS: 

(A) LSNGTBc 122 

<B) TTTBt nucleic acid 

(C) STBAIIDBDRBSSt siogl* 

(D) TQPQLOOYt linaar 

<Jii) 8S0DKKCB OSSCRZPTZOIIt SEQ ZD NOt 171 1 

GAAGTCMAA CTTOQCMXIi AOUIMCATC ACIGGCTGAT TCTCGJUMCT ATATCTCCAA 60 

MTGMCMC AMTTMOM AIGACMTGC CTCIGCCMT ATCACCATCG TGCAATCAAA • 120 

CO 122 

(2) ZNraRMATZOll FOR S8QI1ENCE XDBNTZFZCATZON NUMBERS 172s 
(i) SBQOBirCB CRMACXERZSTZCSs 

(A) LKHOTHt 102 

(B) TYPEt nuclaic acid 

(C) STRABOEDNBSS: Blngle 

(D) TOPOLOGY: limar 

<si) SSQUnCS DBSCRZPTZOHi SEQ ZO NOt 172: 
AOATCATCAC TGGTAIGOCA COCTCAACTG AAfiGACCATA TGTGTCRCA GAOTCTCCCA 60 
TXAfiAATATC AOTATCCaCA CAACOACCAA ATACTTCTTC AT 102 

(2) ZNYOBMATXON FOR SBQOBNCB XDENTZFZCATZW NOMBERl 172s 
<i) SB^^IBNCB CBARACTSRZSTZC8: 

<A) tENO THt 128 

(B) TYVBs ttuclaic add 

(C) ST R A W DEBW ESSi sisiffla 

(D) TOPOiiOOYs liiiBar 

(Si) S E glE WC E DBSCRZFTZONs SEQ ZD NOi 173s 

CXACATCTAC AYCCACCACT QGOACAACCC ATCTTGTAAA AT6T0CGGA6 AMOAOAAAA 60 

CRTCTQTGT CAATCQAO C Q GAGT6CTTCA TG6T6AAAGA CCTTTCAAAC CCCTOGAGAT 120 

ACTTOTGC 128 

(2) ZNFORNATZON FOR SaQOBNCE ZDENTZFZCATZON NONBBRs 174t 
fi) 8BQDBNC8 CBARACTBRZSTZCSi 

(A) LBRC THs 69 

(B) TYPES nucleic acid 

(C) STRANDEDNBSS: ain^le 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRZPTZONs SEQ ZD NOs 174 S 

AA0TGCCAAC CTOGATTCAC TGGACCAACA TGTACTGA6A ATGT6CCCAT 6AAAGTCCAA 60 

AACCAA6AA 89 
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(2) zmowiMzoN nm ssgosncB zoBmzFzaiTxoit numberi 17s> 

(i) SSQUENCB CHMJiCTERZSTZCSt 

(A) LSBGTBs 60 

(B) TYPBs nuelttie acid 

(C) STmNOBDNBSSs fllngls 
(0) TOPOLOGY I linear 

SXQOSflCB DBSCRZPTZOHt SBQ ZD ROt X7S: 

MfiTQCOCM ATQAGTTTAC TCQMATOGC tCCCMAACT AC6TM1G6C CAC C TT C T A C 60 

(2) XnrCRKATZOH FOR SBQOSirCB ZOEKTXrZaTZOir mnCBER; 176s 

(1) SBQCTEVCB CHAJUCTEItlSTZGS s 

(A) ZnOTBt 36 

CB) TYPSt nucleic add 

(C) STRAHDSDNBSSs Bingltt 

(D) TOPOLOOTt llMar 

(xi) SBQOEHCB DBSCRZPTZONt SBQ ZD NOi 176: 
A0TA06TCCA CTCCCTRCT GTCTCT60CT GAATAG 3( 

C2). ZHPOBMMZOH FOR SBQDBRCB XDBMTZFZCATZOB NUMBBRi 177 s 

(1) 8BQUBMUB CBARACTSRZSTZCS t 

(A) LZBOTBx S69 

CB) TTVBs nuelaic acid 

(C) StRAHDPHBBSs single 

(D) TOPOXjOGT: linear 

<xi) S8Q0BBG8 DBSdRXPTZMs SBQ 10 HOs 177: 

AAG60CGAG0 AGCTOncCA OAAO A OAOTO CTGACCATAA C006CATCTG CAT06CCCTC 60 

CTTGtGGTOG GCATGATGtG TGIGGTGGCC TACXCCAAAA CCAAOAAACA COCGAAAAAG 120 

CTBCAVGACC GTCTTC60CA GAGCCROGG TGT6AA0GAA ACAATAWAT QAAGATICCC 180 

AAltSGGCCTC AC CA ICCtAA OOCACCCOCC 0ACAAX6TCC AGCTGGtGAA tCAATAOQTA 240 

TCIAAAAA06 TC A TCT CC AC TQACCATAn GTKACAGAB AABCAGA6AC ATOCTTTTOC 300 

AOCASTCACr AlACRCCAC AGCCCATCAC TCCACXACZO TCACCCA6AC TCCTAOCCAC 360 

A6CTG6A6CA A C C O A C A CA C TQAAACCAf C CTTTCCCAAA GCCACTCT6T AATCCT6ATC 420 

TCAXCOGTAO AAAACA8XAG GCACA6CA0C OCAACTOGGG 6CCCAAGA06 AOGTCttAAT 480 

GGCACAGGAC CCC C TC C TGA ATCTAACACC TTCCMACCC ATOCCAGAGA AACCCCTCAT S40 

TCCTA CO CAC ACT C TCC TC A TA6TGAAA0 S69 

C2) ZRFORMATZOR FOR SBQOBKCE ZDBBTZFZCATZON NtlMBBRt 178t 

(i) 8BQ0BBCB CHARACTBRZSTZCSi 

(A) LENGTBs 730 

<B) TTPEt nucleic acid 

<C) STRAlfDBDHBSSs aiagle 
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(D) tOPOLOOYs linear 
<icl) S8QUENCB DESCRZPTZOMt SSQ ZD NOi 178 1 

GTATCTGTCA GCCATCACCA CCCCGGCTCG TATGTCUCCT GTAGATTTCC ACACGCCAAG 40 

CTC CCC CJUUi TCQCCOCCTT 066AAJIT6TC TCCACCCGTG T0CA6CA1GA OQOTGTCCAT 120 

CCCTI CCM O 606GTCMCC OCRCaiTCGA AGAAOAGAGA CCTCTACTTC TC C T C ACACC 180 

ACCAA6GCI6 OOGGAfiAAOA kOmOkCCh TCAOCCTCAG CAGTTCAfiCT CCTTCCACCA 240 

CAA CC CCCOO CAT6ACABTA ACAGCCTCCC TGCTACCCCC TTCAGGATM TGGAGGATGA 300 

GGACTAMAA AOGACCCAAG ACZAOGACCC ACCCCAAGAG OCTGTXAAGA AACT06CCAA 380 

nCCOGGOGC CCeAA A AOAA OCAAGCCCAA TGGCCACATT GCTAACAGAX TGGAAGtGGA 420 

CACCMCACA AGCtCeCAGA GCAGTAACTC A6AGAGTGAA ACAGAAGA16 AAAGAGTACG 480 

tOAAOATAOO CCTXTCCXGG OCATAGAlBAA CO C CC TC C CA GOCA GIC tT G ACGCAACACC 540 

TGOCTTCOGC CT66CTGACA 6CA66ACTAA CCCAGGAGCt CGCTTCT06A CAGAGGAAGA 600 

AAT CC AOOCC AGGCTGTCTA GtGTAAnCC TAACCAAGAC CCTAR6CXG TATAAAACCT 660 

AAATAAACAC ASAGARCAC CT6TAAAACT TTARTTATA TAA7AAAGTA TTCCACCTTA 720 

AATTAAACAA 730 

(2) unramATZOH rm sbqobkcb zobntzfzcatzon inmBSRt 179 1 

<i> SBQOBBCB CBARACTSRZSTZCS: 

<A) LRBO THi 23 

(B) TTRs nuelele acid 

(C) sniAIIDSDllSSSs aingls 
<0) TOPOloats linMT 

(xi) StOaiRCB DBSCRZPTZONs SBQ ZD NOt 179 1 

TCGGGCTCCA TGAAGAAGAT GTA 23 

(2) ZNrOJMATZOli FOR SEQUENCE ZOBNTZFZCATZOM NOHBERs 180i 
(i) SBQOENCB CBARACTEiaSTZCSt 

(A) LESOTBt 23 

(B) TTPBi nuclGic «eid . 

(C) STHAMDEDNESSs singl* 
(D> tOMLOGYs linfl 



Cxl) SBaOBNCB DBSCRZnZONt SBQ ZD MOt 180i 
TCCATOAAGA AGATGTACCT GCT 23 

<2) ZNFOBHATZON FOR 8 EQ UEMCE ZOBNTZFZCATZON NUNBERi 181 1 
(i) 88Q0ENC8 CBARACTSRZSTZCS s 

|A> LSBGTBi 22 

(B) TTFBs nuelaie moid 

(C) STRANDEDNBSSs Singltt 
(0) TOPOLOGY t linear 
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Cxi) SEgomCB OESOIXPTXOlIt SBQ ZD NOt 181 1 
ATGTACCTGC rGTCCT CC 'lT M 22 

(2) XNTORMJITZON FOR SEQUSHCE IDENTZFZOITZON NUMBERS 182: 
(i) SEQUSHCE CSUUiCTERZSTXCSi 

(A) XJBHG THt 22 

(B) mst nuclBie acid 

(C) STRMIDEDIlBSSi Blngle 
<D) TOMLOaEi Xiuar 

(Xl> SBOnBRCB DBSaaPTZONt SBQ ZD NOs 182 s 

TTGMGMCG ACTCGCTGCT Ch 22 

(2) XBTORMXTZON FOR SBQUBKCE XDSmXFXCATXQN NOKBERs 182 1 
(1) SBQDBBCB GBARACXERXSTXCS i 

(A) LEHGTB: 20 

(B) TYPES nocl0ie acid 

(C) STRANDEDNESS: singla 

(D) TOFOZXXnrs linMr 

(Si) SEQaBRCB DESCRXPTZOMs SBQ ZD NOs 183 > 
AAACCCOCCG GCTTGAAOAA 20 

(2) INFORMATZON FOR SEQUENCE ZOENTZFZCATXON NUMBERS 184s 
|i) SEQUENCE CHARACTBRZSTZCSs 

(A) LENGTHS 20 

(B) TYPES nuelBie acid 

(C) STRANDEDNESS: singla 
(0) TOPOZMYs limar 

(Xi) SEQUENCE DBSCRZPTXONs SEQ ZD NOs 184 s 

ATGARCTGT6 GG06600AAA 20 

(2) INFORMATION FOR SEQUENCE ZOENTZFZCATXON NUMBERS 185s 
(i) SEQUENCE CHARACTSRXSTXC8S 

(A) LENOTBs 422 

(B) TYPES amino acid 

(C) STRANDEDNESS s 

(D) TOPOZMYs limar 

(Xi) SEQUENCE DBSCRXPTXONs SBQ ZD NOi 185: 

Mat Arg Trp Arg Arg Ala Pro Arg Arg Sar Cly Arg Pro Gly Pro Arg 
I S 10 15 

Ala Cln Arg Pro Gly Sar Ala Ala Arg Sar Sar Pre Pro Lau Pro Lau 

20 25 30 

LOU Pro Lau Lou Lau Lau Lau Gly Thr Ala Ala Lau Ala Pr Gly Ala 
35 40 45 
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Ala Ala Gly Asn Olu Ala Ala »r Ala Gly Ala Sar Val Cya Tyr sar 

SO 55 60 

Sar Pro Pro sar Val Gly Sar Val Gin Glu Lau Ala Gla Arg Ala Ala 

65 70 75 80 

Val Val Zla Olu Gly Lya Val Hia Pro Gin Arg Arg Gin Gin Gly Ala 
85 90 95 

Latt Aap Arg Lya Ala Ala Ala AXa Ala Gly Glu Ala Gly Ala Trp Gly 

100 105 110 

Gly Aap Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Lau Gly Pro Pro 

115 120 125 

Ala Glu Glu Pro Uu Lau Ala Ala Aan Gly Thr Val Pro sar Trp Pro 
130 135 140 

Thr Ala Pro Val Pro Sar Ala Gly Glu Pro Gly Glu Glu Ala Pro Tyr 
145 150 155 160 

Lau Val Lya Val Hia Gla Val Trp Ala Val Lya Ala Gly Gly Lau Lya 
165 170 175 

Lya Aap Sar Lau Lau Thr Val Arg Lau Gly Thr Trp Gly Hia Pro Ala 
180 185 190 

Pha Pro Sar cya Gly Arg Lau Lya Glu Asp sar Arg Tyr Xla Pha Pha 

195 200 205 

Hat Glu Pro Aap Ala Aan Sar Thr sar Arg Ala Pro Ala Ala Pha Arg 
210 215 220 

Ala sar Pha Pro Pro Lau Glu Thr Gly Arg Aan Lau Lya Lya Glu Val 
225 230 235 240 

Sar Arg val Lau cya Lya Arg Cya Ala Lau Pro Pro Gin Lau Lya Glu 
245 250 255 

Mat Lya sar Gin Glu Sar Ala Ala Gly Sar Lya Lau val Lau Arg Cya 
260 265 270 

Glu Thr Sar Sar Glu Tyr Sar Sar Lau Arg Pha Lya Trp Pha Lya Aan 

275 280 285 

Gly Aan Glu Lau Ann Arg Lya Aan Lya Pro Gin Aan Zla Lya Zla Gin 
290 295 300 

Lya Lya Pro Gly Lya Sar Glu Lau Arg Zla Aan Lya Ala Sar Lau Ala 

305 310 315 320 

Aap Sar Gly Glu Tyr Mat Cya Lya Val Zla Sar Lya Lau Gly Aan Aap 

325 330 335 

sar Ala Sar Ala Aan Zla Thr Zla Val Glu sar Aan Ala Thr sar Thr 
340 345 350 

Sar Thr Thr Gly Thr Sar Hia Leu Val Lya Cya Ala Glu Lya Glu Lya 

355 360 365 

Thr Pha Cya Val Aan Gly Gly Glu Cya Pha Mat val Lya Aap Lau Sar 

370 375 380 

Aan Pro Sar Arg Tyr Lau Cya Lya Cya Pro Aan Glu Pha Thr Gly Aap 

385 390 395 400 
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Arg Cym Gin Asa Tyr Val Mt Ma Sar 

405 

Pha Lau Sar Lau Pro Glu 
420 

What la eXaiaad las 
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Pha Tyr sar Thr Sar Thr Pr 

410 415 
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1. A Bathed of Baking a Badieaaent for tha 
traating of auscle calls of a Baaaal said Bathed 
cooprising adaixing a polypaptide encoded by p66F2HBS5 
daposltad with tha A.T.C.C. NovaBbar 6, 1992 (A.t.c.c. 
5 Deposit Mo. 75347) with a pharaacautical carrier. 

a. A aethod of aaking a aadieaaaat for tha 
traating of Busela cells of a aaaaal, said aethod 
eo^rising adaixing a polypeptide encoded by the E 
sequence (SEQ ID Nos. 133 and 159) and at least a portion 
10 of tha peptide encoded by the DHA sequences flanking the 
E encoding sequence on clone p6GF2HBS5, deposited with 
the A.T.C.C. Hoveabar 6, 1992 (A.T.C.C. Deposit No. 
75347) . 

3. A aethod of aaking a aedicaaent for the 
15 treating of msele cells of a aaaaal, said aethod 

eoavrising adaixiag a polypeptide defined by the f anmila 

YBAZCX 

trherein YBAZCX is coaposed of the polypeptide 
segaents shown in Pig. 30 (SEQ ID Nos. 133-135, 156, 

20 159); wherein y coaprises polypeptide segaent E, or is ' 
absent; wherein Z coaprises polypeptide segaent 6 or is 
absent; and irtierein X coaprises polypeptide segaents c/D 
BKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' HKL, C/0' B, 
C/D' D, C/D C/D' KKL, C/D C/D' H, C/D C/D' BL, C/D C/D' 

25 D, C/D D' B, C/D D' BL, C/D D' HKL, C/D' D' B, C/D' D' 
BL, C/D' D' BXL, C/D C/D' D' B, C/D C/D' D' BL, or C/D 
C/D' D' BKL With a pharaaeeutical carrier. 
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4. A method of making a medicament for the 
treating of muscle cells of a mammal # said method 
comprising admixing a polypeptide defined by the formula 

WBAZCX 

5 vherein HBAZCX is coa^sed of the polypeptide 

segments shown in Fig. 30 (SEQ ID Nos* 132, 134, 135, 
137-139, 156); wherein W coqprises polypeptide segment F, 
or is absent; wherein Z comprises polypeptide segment G 
or is absent; and wherein X comprises polypeptide 
10 segments C/D RKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' 

HKL, C/D' H, C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' 
HL, C/D C/D' D, C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' 
H, C/D' D' HL, C/0' D' HKL, C/D C/D' D' H, C/D C/D' D' 
HL, or C/D C/D' D' HKL with a pharmaceutical carrier. 

15 5. The method of any one of claims 1-3, %rherein 

50 N«-terminal amino acids are cleaved from said peptide 
comprising the E sequence (SEQ ID Nos. 133 and 159) • 

6. The method of claim 3 or 4, wherein X is C/D 

HKL. 

20 7. The method of claim 3 or 4, wherein X is C/D 

H. 

8. The method of claim 3 or 4, wherein X is C/D* 

HL. 

9. The method of claim 3 or 4, wherein X is C/D 

25 D. 



10. The method of claim 3 or 4, wherein X is C/D' 

KL. 
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11. Tha mthod of claia 3 or 4, wh rein X is C/0' 

HXL. 



12. The aethod o£ claim 3 or 4, wherein X is C/0« 

H. 



5 13. The Mthod o£ claia 3 or 4, vherein X is C/D' 

0. 

14. The nethod of claia 3 or 4, vherein X is C/0 
C/D' HKL. 

15. The aethod of claia 3 or 4, vherein X is C/D 

10 C/D' H. 

16. The aethod of claia 3 or 4, vherein x is c/D 

C/D' HL. 

17. The aethod of eiaia 3 or 4, vherein X is C/0 

C/D' 0. 

15 18. The aethod of claia 3 or 4, vherein X is C/D 

0' H. 

19. The aethod of claia 3 or 4, vherein X is C/D 

0' HL. 

20. The aethod of claia 3 or 4, vherein X is c/D 

20 D' ESX. 

21. The aethod of claia 3 or 4, vherein X is C/D' 

D' H. 
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22 • The aethod of claia 3 or 4, wherein X is C/D' 

23, She method of claia 3 or 4, wherein X is C/D' 

HKL. 

5 24. The aethod of claia 3 or 4, wherein X is C/D 

C/D' O' H. 

25. The aethod of claia 3 or 4» wherein X is C/D 
C/D' D' BL. 

26. The aethod of claia 3 or 4, wherein X is C/D 

10 C/D' D' 



27. A aethod of asking a aedicaaent for the 
treating of aiiscle cells of a aaaaal said aethod 
coaprising adaixing a polypeptide coaprising FBh 
polypeptide segaents having the aaino acid sequences 

15 shown in Fig. 30 (SBQ ID Nos. 132, 134, 135) with a 
pharaaceutically acceptable carrier. 

28. A aethod of aaking a aedicaaent for the 
treating of auscle cells of a aaaaal said aethod 
coaprising adaixing a polypeptide coaprising FBA' 

20 polypeptide segaents having the aaino acid sequences 
shown in Pig« 30 (SEQ ZD Nos. 132, 134, 136) with a 
pharaaceutically acceptable carrier. 



29. A aethod of asking a aedicaaent for the 
treating of auscle cells of a aaaaal said aethod 
25 coaprising adaixing a polypeptide coaprising FZBA 
polypeptide segaents having the aaino acid sequences 
shown in Fig. 30 (SEQ ZD Nos. 132, 135, 159) with a 
pharaaceutically acceptable carrier. 
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30. A neth d of making a a dlcament for th 
treating of nuscle cells of a naiBBal said method 
comprising admixing a polypeptide comprising FEBA' 
polypeptide segments having the amino acid sequences 
5 corresponding to polypeptide segments shovn in Pig. 30 
(8EQ ZD Hos. 132-134, 136, 159) to muscle cells with a 
^armaceutioally acceptable carrier. 



31. A method of making a medicament for the 
treating of muscle cells of a mammal said method 

10 comprising admixing G6F2 polypeptide with a 
^armaceutically acceptable carrier. 

32. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a compound which specifically binds 

15 the pi85^*^ receptor of muscle cells with a 
pharmaceutically acceptable carrier. 

33. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising E6PL1, 

20 having the amino acid sequence shoim Fig. 37, Seq. ZD No. 
150, with a pharmaceutically acceptable carrier. 

34. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising EGFL2, 

25 having the amino acid sequence shovn in Fig. 38, Seq. ZD 
Ho. 151, vith a pharmaceutically acceptable carrier. 
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35. A method of i&aking a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising B6FU, vith 
the amino acid sequence shown in Pig. 39^ Seg« ZD No. 

5 152, vith a pharmaceutically acceptable carrier. 

36. A method of making a medicament for the 
treating of moscle cells of a mammal said method 
comprising admixing a polypeptide comprising BGPL4, vith 
the amino acid sequence shovn in Pig. 40, Seq. ZD Mo. 

10 153, vith a pharmaceutically acceptable carrier. 



37. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising E6PI«5, vith 
the amino acid sequence shovn in Pig. 41, Seq. ZD No. 

15 154, to muscle cells, vith a pharmaceutically acceptable 
carrier. 

38. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
coiq»rising admixing a polypeptide , coq>rising E6FL6, vith 

20 the amino acid sequence shovn Pig. 42, Seq. ZD No. 155, 
vith a pharmaceutically acceptable carrier. 

39. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 35 kD polypeptide factor isolated 

25 from the rat Z-EJ transformed fibroblast cell line to 
said muscle cells, vith a pharmaceutically acceptable 
carrier. 
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40 • A Bethod of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 75 kD polypeptide factor isolated 
from the 8KBR*3 human breast cell line to said muscle 
5 cells, vith a pharmaeeutically acceptable carrier. 

41. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 44 kD polypeptide factor isolated 
from the rat I«EJ transformed fibroblast cell line to 

10 said muscle cells, vith a pharmaeeutically acceptable 
carrier. 

42. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 45 kO polypeptide factor isolated 

15 from the MDA - MB 231 human breast cell line to said 

muscle cells, vith a pharmaeeutically acceptable carrier. 

43. A method of making a medicament for the 
treating of muscle cells of a mammal said xoethod 
comprising admixing a 7 to 14 kO polypeptide factor 

20 isolated from the ATL*2 human T«-cell line to said muscle 
cells, vith a pharmaeeutically acceptable carrier. 

44. A method of making a medicament for the 
treating of musele cells of a mammal said method 
comprising admixing a 25 kO polypeptide factor isolated 

25 from activated mouse peritoneal macrophages to said 

muscle cells, vith a pharmaeeutically acceptable carrier. 
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45. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 25 kD polypeptide factor isolated 
from bovine kidney to said muscle cells, vith a 
5 phamaceutically acceptable carrier. 

46 • A method of making a medicament for the 
treating of muscle cells of a mammal said method 
GOi^ising admixing a ARZA polypeptide to said muscle 
cells, vith a pharmaceutically acceptable carrier. 

10 47. A method of making a medicament for the 

treating of muscle cells of a mammal said method 
comprising admixing a 46«*47 ko polypeptide factor which 
stimulates 0-2A glial progenitor cells to said muscle 
cells, vith a pharmaceutically acceptable carrier. 

15 48. A method of making a medicament for the 

treating of muscle cells of a mammal said method 
comprising admixing GGF^ZZI to said muscle cells, vith a 
pharmaceutically acceptable carrier. 
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49. A aathod of Baking a aedicament for tta 
treating of auscle cells of a aaaaal, said aethod 
comprising adaixing with a pharaaceutically acceptable 
carrier, a DHA segiuence encoding a polypeptide of the 
5 foraula 

YBAZCX 

wherein YBAZCX is coaposed of the polypeptide 
eegaente ehom in Fig. 30 (SEQ ID Nos. 133-135, 156, 
159) ; wherein Y coaprises polypeptide segaent E, or is 

10 absent; wherein Z coaprises polypeptide segaent 6 or is 
absent; and wherein X GcmpriBmrn polypeptide segaents C/D 
HXL, C/D B, C/D HL, C/D D, C/D' BL, C/D' BRL, C/D' B, 
C/D' D, C/D C/D' BKL, C/D C/D' B, C/D C/D' BL, C/D C/D' 
D, C/D D' B, C/D D' BL, C/D D' BKL, C/D' D' B, C/D' D' 

15 BL, C/D' D' BKL, C/D C/D' D' B, C/D C/D' D' BL, or C/0 
C/D' D' BKL, said DBA in an eiq^ressible genetic 
construction • 
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50. A method of aaking a nedicaaent for the 
treating of auscle cells of a Baomal, said aethod 
conprieing edaixiitg with a phamaceutically acceptable 
carrier, a DKA aeguence encoding a polypeptide of the 

5 f omila 

WBAZCX 

vherein HBAZCX is coaposed of the polypeptide 
aegaents shown in Pig. 30 (SEQ ZD Hoe. 132, 134, 135, 
137-139, 156); vherein W coiq^ises polypeptide segaent P, 

10 or is absent; irtierein Z eoqnrises polypeptide segaent G 
or is absent; and vherein X comprises polypeptide 
segments C/D HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D^ 
HKL, C/D' H, C/D' O, C/D C/D' HKL, C/D C/D' H, C/D C/D' 
HL, C/D C/D' D, C/D D' H, C/D D' RL, C/D D' HKL, C/D' D' 

15 B, C/D' D' BL, C/D' D' HKL, C/D C/D' D' H, C/D C/D' D' 
HL, or C/D C/D' D' HKL, said DMA in an expressible 
genetic construction. 

51. A aethod of Baking a aedicaaent for the 
prophylaxis or treataent of pathophysiological condition 

20 of the musculature in a aaaaal in vhich said condition 
involves a auscle cell type vhich is sensitive or 
responsive to a polypeptide as defined in any one of 
claims 1, 3, 4, and 31, said aethod coaq>rising adaixing 
an effective amount of said polypeptide vith a 

25 pharmaceutically acceptable carrier. 

52. A method of making a medicament for the 
treatment of a condition irtiich involves muscle damage in 
a mammal, said method comprising admixing an effective 
amount of a polypeptide, as defined in any one of claims 

30 1, 3, 4, and 31 vith a pharaceutically acceptable 
carrier. 
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53. The aethod of any one of claiois 3, 4, and 
31, wherein said aedicaaent is for decreasing t:he atrophy 
of said muscle cells* 

54. The aethod of any one of claias l, 3, 4, and 
5 31, ifherein said aedicaaent is for increasing the auscle 

fibers present in said aaaaal. 

55. The aethod of any one of claias 1, 3, 4, and 
31, wherein said aedicaaent is for increasing auscle cell 
survival in a said aaaaal. 

ID S6. The aethod of any one of claias 1, 3, 4, and 

31, wherein said aedicaaent is for increasing auscle 
growth in a said aaaaal. 

57. The aethod of any one of claias 1, 3, 4, and 
31, irtierein said aedicaaent is for increasing auscle 

15 regeneration in a said aaaaal. 

58. The aethod of any one of claias 1, 3, 4, and 
31, wherein said aedicaaent is for stiaulating auscle 
cell aitogenesis. 

59. The aethod of any one of claias l, 3, 4, and 
20 31, wherein said aedicaaent is for increasing 

acetylcholine receptor synthesis. 

60. The aethod of any one of claias 1, 3, 4, and 
31, wherein said aedicaaent is for treating a patient 
lacking a neurotrophic factor. 
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61. A aethod of clains i, 3, 4, and 31, wherein 
said Bedicanent Is for treating a muscle cell which is a 
ayoblast. 

62. A aethod of claias 1, 3, 4, and 31, wherein 
5 said aedicaaent is for treating a auscle cell %diiGh is a 

satellite cell. 

63. X aethod of claias l, 3, 4, and 31 wherein 
said aedicaaent is for treating a auscle cell in skeletal 
auscle. 

10 64. A aethod of claias i, 3, 4, and 31 wherein 

said aedicaaent is for treating a auscle cell in cardiac 
auscle. 

65. A aethod Of claias l, 3, 4, and 31 wherein 
said aedicaaent is for treating a XBUScle cell in saooth 

15 auscle. 

66. A aethod of claias i, 3, 4, and 31, wherein 
said aedicaaent is for treating a auscle cell in a 
patient with a skeletal auscle disease. 

67. A aethod of claia 66, wherein said skeletal 
20 auscle disease is a ayopathy. 

68. A aethod of claia 66, wherein said skeletal 
auscle disease is a dystro^y. 

69. A aethod of claia 68, wherein said dystrophy 
is Duchennes auscular dystrophy. 
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70. Am thod Of claim 68, wherein said dystrophy 
is Beckker's dystrophy. 

71. A method of claim 66, wherein said skeletal 
muscle disease is a result of a neural condition. 

5 72. A method of claim 66, wherein said skeletal 

muscle disease is an injury. 

73. A method of claim 66, wherein said skeletal 
muscle disease is resulting from a nerve injury. 

74. A method of claim 66, wherein said skeletal 
10 muscle disease is resulting from a neuropathy. 

75. A method of claims 1, 3, 4, and 31, wherein 
said medicament is for treating a muscle cell in a 
patient with a cardiac muscle disorder. 

76. A method of claim 75, wherein said cardiac 
15 disorder is cardiomyopathy. 

77. A method of claim 75, wherein said cardiac 
disorder is ischemic damage. 

78. A method of claim 75, wherein said cardiac 
disorder is a congenital disease. 

20 79. A method of claim 75, wherein said cardiac 

disorder is cardiac trauma. 

80. A method of claims l, 3, 4, and 31, irtierein 
said medicament is for treating a muscle cell in a 
patient with a smooth muscle disorder. 
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81. A aethod of claim 80, wherein said disorder 
is arterial sclerosis. 

82. A netbod of clain 80, wherein said disorder 
is a vascular lesion. 

5 83. A aethod of claia 80, wherein said disorder 

is a congenital vascular disease. 

84. A aethod of claias 1, 3, 4, and 31, irtierein 
said aedicaaent is for treating a ausde cell Mich has 
insufficient functional acetylcholine receptors. 

10 85. A aethod of claia 85 wherein said auscle cell 

lacking sufficient acetylcholine receptor is a muscle 
cell in a patient with i^asthenia gravis. 

86. A aethod as claimed in claia 84, wherein said 
condition involves auscular daaage. 

15 87. A aethod of aaking a aedicaaent for the 

prophylaxis or treataent of a auscular tuaor in a 
patient, said aethod coaprising adaixing an effective, 
aaount of a substance irtiich inhibits the binding of a 
factor as defined in any one of claias 1, 3, 4, and 31 to 

20 a receptor therefor with a pharaaceutically acceptable 
carrier. 

88. A aethod of aaking a aedicaaent for treating 
a nannal suffering from a disease of auscle cell 
proliferation, said method coaprising adaixing an 
25 antibody which binds to a polypeptide of any of one of 

claias l, 3, 4, and 31 with a pharaaceutically acceptable 
carrier. 
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89. A nethod of identifying a nucleic acid 
sequence coding for a molecule having auscle cell 
Bitogenic activity, said sethod coaprising contacting a 
cell containing saaQ>le with a nuscle cell aitogen 

5 specific antibody to deteraine eaqpression of said aitogen 
in said saaple and isolating said nucleic acid sequence 
froa the cells exhibiting said expression. 

90. The method of claim 31, wherein said.G6F2 is 
huaan recoabinant GGT2. 

ip 91. A aethod of stimulating ayogenesis of a 

auscle cell said aethod comprising contacting said muscle 
cell with a coispound which specifically binds the 
pl85*^^ receptor of muscle cells. 
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011f« 
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IM 
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7n 
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77» 
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1* tMI 


(SB 8 Bl Mil 


V IMS 
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(SB B Bl 


f* Ma 
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(IB B Bt 106} 
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(SB B iDi im 

(SB B 811 It3) 



BAD ORIQiNAL 



WOMOfiZM 



PCT/US94/«5M3 



17/55 BA0 0R|Q»USL 



fiont laib 



p 711/713 



p4«7/7ia 



pni/ria 



0711/713 



0771 

c: 



»4» 



3 

pTTt 



en 



p73t 



P<I8 



pTSt 



3 



a 



P4I7/71I 




P4|7/rit 



nGURE23 



BAD ORIQINAL 



wo94aaM 



PCT/US94/05083 



18/55 




suBsmruTE sheet (rule 26) 



W094M298 



PCT/US94/05083 
19/55 BAD ORIGINAL 




nCURE 25 



BAD ORIGINAL 



W09MC298 



20/SS 



FCT/USM/05083 



VHQWAAK 
II HQWAAX AMU 



xx«io ouucv 

14 s MUB tfallftV MM 



(589 D «i ii7| 



III ur«i i««««iip«rp«cf MJ80 am 



116) 

HIT) 



H lU) 



ZZM 



tVU 



ill! cuxi RMuesonNoi vim (a* o vi ubi 



AsuemYMtK (nan Mt ur> 

isa: uan «»udM«yMk vxm (119 b mi uei 



nCURE 26 



BAD ORIGINAL 



WOMOCZM FCT/IIS94/05083 

21/55 



tSifciMiaMmiHcii 



FIGURE 27 »/3 



BADORIQINAL 



FCT/US94/05083 



22/55 



AtiCifiiicttflttilCiCilff LJLJ I I lift 
• #tii«i«ii»€tli%tlf<ifi4ittcfa « tfi 
• C I < LJUl A t 4 ■ ■ I t f • i LJ^ • t • f « i t i • ^ 



ictictftiifiwifttCfmfctnnsfiurManuiciivc^ 



I'Vy^**"- M MSMM^mmMmm^^mm 

ism s m% ISO) 

nGURE27 



BAD ORIGINAL 



W094aC98 PCTABMW083 

23/55 



• ««• t m 99 all* If I 



(ificiiatiMfmHCMiiM«i«Micna<ttJiMicicicteiMr#itft«iC^r^ ^ 



•it»i>«i«««»««t«««Ai»*a««ltii«cif 

Atllifilltt|«|tt9%Cftfftf LJJI f i f • c • 

••"TTiifln'fjTriiciTmiimirrimirT — r'TrirTiTr'-'TM'rTTr""rTfrfirT'iTrTiiiiTM mi 

C » I • I C I « I l»if«l« « I tt«t%«9ttltt«C9| 

t<iMtM<tMBi4>t»<i*iigtcf WMCif iig»tt<!caMt M 1 1 rOTici tccitttctieimiriiriiOTi totiimw ill 

• til ■ > t « • A ■ I t I t t I CXJ •••t««l|t(fffff 



ticMi<i«MM«iiMnia«Nt<Mt«uMU«iinta 



ntaci<iiMttt0WMi<tiatut«ica«cnaiacfacncucttnimflc^^ m 
• I ••ic«if9««a»*tttt»«iii»i« 

cinniinfC(8ttflcc^cittii<(i«iaMttiMt«aimiAMtciMaffiM m 
umiiiMi iiiiiiiiHi H II iin'iiiH mill H wi ' in 'i' flf i iiiiiiiiiiiiiiii m 

tWMCN 



FIGURE 27 3/3 



\V0 94a629» 



PCTASM/OSOU 



24/SS 



FIG. 28 




suBsrrruTE sheet (rule 26) 



wo 9406298 



25/S5 



PCT/US94AM0g3 



: — : 




SUBSTITUTE SHEET (RULE 26) 



WOMaS2M . 25,55 ICI/US«fl»S083 

FIGURE 30 'A 
coeiM steam or «UAfc ctovm rActeviicMcvuv cot 

AcnreeecceeauicmfcecAAeTcreeccrceeece^ i« 
6cccccTcccaMcai«ecA6CGe6MceccAeeCTJUfC6ccTereccfccr^^ iit 

ceAoce^9C6caccMeA«ccAcceecc«Aanccice6^ }i0 
j>cTccaocw6cccecj^ccoc»ccnccoTccccc^^ 

<CTcc c ccc a ccccccc«ccccrccccccc c yc o cwoccc o «cTca^^ )i9 

1111 llllIU 11 Ml nil II II II III mil 
c««f M cctc«fc«e«««c«etc««tet«. .M«etcf «ffff«ff 

JuucrmcMCJa«cc6Akecx6cccTC66Accauu^cr^ «m 

UUUiUUUlU,W!ii.U Wii 

c9«9*«ee«tee«c«u««4e«ct«.c«tetee««eVH«t«tfe««««fMaafMtft 
I atataa'i io*i«fo gi»*?»A* 

jUU(CCeUCflCCMflflO CC OCJU CM 6CACCCA60CtCCCOCJUCMO C CC M^^ S4t 

I illlll tllUIIll ItllMIl illllljllll llllllll I I Hill 
•««4fc«M««ftaw9e«««a««a«4«»«e«»McteeTCC«a«M«ce9|««teefef 

4 a • » • » A 

ccTCoeeecccttoeeeM sit 

II HI HI llllllll 



oeoiM fianf ii a at 119 

ceTj^TCTTmcAfea4ecceAeeceMCAce»fC9«tf«C4<cc9«cc«ccnc«tt iit 
virrRt»iAMtt«af«ti,ff 

<leeT«ct7ecceecTe7ceAeAeee«cessMcc7cMSAAesAC4reA6ceeecTecTe i 10 

• » 

: .» : 



M/OMOOM PCT/US94rt>S083 

27/55 

eeeiM stonrt it (m d mt an 

I » I I t *| I I i « t •* 1^ A « I 6 f . 

c cn a accc c ccrrejuuacAWuacTCAccACTcwTcceaccTTc^ it 
illlllllillll lilllllllillll II lllll II I titllfllfffiiiil 

cett«eeteeec««tt«a««««««t9«««i9ee«99«atc9fet«e««fttce«««ctif 

t8c'srS*8 8T8 s b8*rtv*rfM« 

TcmeecTeeeMAeeMTTCRAATAcrecTCTCTCAAaTTeMeNem no 

I iiiiiiii II Miiiiiiitttiiiiiiiitiiii iiititiiiiinnim 

9 I t»'i tt'l l>i 1 S I* I «t't»« 

III iiiiii I lllll iiiiiiitiiiiii iiiiiiiiiit nil III II 

MMt9««ttqt«te9««a««ae«tMeae««*«t««e««9«t«e«a«MM9ce4w 



ceoiM sioart kt (mom ijs) 

t • I *b I t • t A t* t» A » *« • t r « e r 

cjumMMMTTceaimfiouuweeTCAeroQCiQATtctoc^ lo 
llllllllllllllllllll IIIIII lllllllllllillllllll lllllllllll 

V Zi*SL«l»fA*< 4«*S *tfafff' 

A6T6AtexoeMUcrMauutCAO0i«eeTcroeouaiteAeafiete«^^ lao 

Illlllllillll lllllllllllllllllllllllll IIIIIIII lllll lllll 
•Tt9ate«5iea«ttt«4«aMt«MHt«cetet9eeaat«teaee»«egtnMtcaM 



ceozM ttowf k't cm B lot lai) 

rctMj^cnekuiBkimhTnKXKXfcxtw «• 



• 



ttttBttfifiai 

cGCTTTCcrccKxicrKtfMMetekc>jkcnc9Ck^ lae 

ATTCTccACJuiTArATeTeeuMKAmeeAAAcrAcejuuTuaeTeecTCTec^ no 

I Tx vcsiieKae llrai'sosl 
ACATeAccATTCTCCAeTejuucMtAAeACATeccTACTcceTecTAmcrcAercTe a «e 

« e v' : f *c : s T • 

BAOORIQINM. 



wo 940099 



PCT/US94/0S083 



28/55 



cooxNc stooxT ai {m u »< U7> 

It ft' E TAT*fti*gi»t 

AaAtaeaeTC6aiceaficcTCJuemaA€A0ceTAi«T«T^^ «« 

iiini iiiiii iiiMiiiiiiiniii III iiiiiiiiiiiiiiiii ifiH 

It I* fit 'l 

iiiiiiiHiiiiiii iiiiiiiii iiiiiimiiiiii 

Ua«««t«te«fUtecie««c«M«fc«««t«ettette«t 

eoozM stooM CI im o i^<)* 

f t t'fti'a ft ■ >_ f I' c _A t *i 1 I f 

ccACATcoaifcnkfiMcieceACXAoeeAfc^^ it 

I II II mil II iiiiiiiiiiiiiimii 11 mil iimiiiimi 

et«e«tetM4tee«€e«etTCf«ca««ce«cettf«a««a«ft«effaf«a«9a«MM 

• • • • • » 

rcfgcaicFa w t 9 t t ■ » • • r 

iiiiiiiimiiiiiiii iiiiimiiiiiimiiijmiifiii iiiii mi 

b « 



BAOORIQiNM. 



W094aC298 PCTAJSM/05089 

nCURI 30 -H^. 

COOtiO SMnifT C/Dl W D Bi 138) 

MCTCceuccicanacTecAecacAi«TAci«ACAAi«TcccafCMA6T^ i« 

iiijiiitiiiiiiiiHiiiiiiii iiimiiiNiiimiiiiiimmtfM 

9 9 t 

I tlltlll 

■ 

OMIM SfQNBff cm O Wt 14<}) 

I e » I I r f * • » t *e 0 ■ t > «* a' • r t * 

XfcCTOCCCXAATCACrrrACTCCTCJlTCCCTtXCXUACTACOTJUWOCCJW^ M 

iiiliiiiiiitiiiiiiiiiiiiitiiitiiiiiiimiliiiuTii^^ 

fioeiM staoNT Dt (sm» xw^ 
t ff • V » i* t » *t • ' 

AOTACCTCCA C T UXVll ' CAWKIC W C CT g AAtAa 3« 

muuumuiumiiiu 

» 

ceotM stQSDir o^t <tmi »* ui) 

S I L • S t f ' ■ • 

<««MtettMfatt«Mtcutffif av. 



BAD ORIGINAL 



WOMOOM FCT/US94A)5083 

30/55 

FIGURE 30 S/e 
eooiM itonr? ii o lot \u% 

II iiiiiiiiiii iiiiiiiMiiiiiiii mil iiitiiinTnifiirr 

l, Vv'«XHCr €8*f K8'i 8 8 8* 

II mil iiiiiiiiiiiiiiiiiii iimmmiimmii mmm 

ett«t44teq«c«te«t«tft«t4«tMcc««ct«c««««ce««««4ae4«e9f«««Mf 

L 8P8t8«'f Li'f f •* fia'ifA* 

CrrCATCACCOOCTXCCCCAOJ^CCrTCCCTCTCUACJUaCACCAfC^ ic« 

II imiiii imimiiriiimiMim mm imiimi in 

et«e«t««eeftettcTCe«f««««t^^«^et«««e«««4e«at«t««tf««c«ttffee 

■ I 

leceeAAfeecccGecccACtAccTeaMCTeoTeAATCJurAecn 



II iiiii mil II II II IIIIIIIIIII II imiimimiimrii 

««t9««ectc«ee«teet««cee«cccece9««i«t«tec«9ctf9t««4tc««UefU 
• BIVXtt'lB X !V8 B8ABf'f9f* 

iimiii iiimii II iiiiiiiMiiiiiMiiii niiiiiiiiiiii 

V f iTTir'A88*f ttft«T*ftl* 

ACCMTCACtAMCTTCaiOeeSeJtTCATrCeACrACMfCACTeA^ )fO 

iiimiiiii mil iiiii Mill iiiiiiiimiii imiin 

«eet9te«ct«t»cttee«e49«ee«te«etee4ct«etgtc«eee494ctect4«ce4« 
• • • • • , 

iimimii iitiiHUHHiHjMiiMmmiiiiiii mnnii 

49«t«w«44e««4«4e4ct9««««€4teetitee94444ee4etetft44tevt94tf 



TCATcco m wjukactAOce> a ce>flccwAc»66cc^^ 4i« 

iimmiiiimiiimiiiiiiiiiii mimmi mmimi m 

te«tce9t4f«aa«e««t49«c«ca«eA9eee«4et99«9«cec«a9af9«e^ett««ft 
• ba«rii*ef s'r Lti4ii*f»e' 

III IMIIIillllllllllllllllllllMlllllllllllllllllTimTl 

TCCTACCCACACTCTCCTCATACTCAAAC 5«t RAO ORIQIIML 

I'l'I MMi: "milMllllIM 



WOMOCM 31/55 PCr/IIS»4/«SQ83 

eooxM »eii»rr (m o 15?) FIGIT 1, 30 6?/ 9 

cafciTreeccAAcTcxtmAaecncTrcamcccA 
«biAriLiAitxyg« A IF it 

tiACCeCTTCGCCmA^CAAa 

f » I f tr 0 I 

CO0IV4 SfOMBfT U (S« a Ml 149) 

ft«t«tf«c«««c«t«M«Mcee«i«ce«t«t«tc«cet«t4f«iUee«e«e«ee«af 

cTcecccAAeTCAceeeeTrcecAMTVTccceQcccctmcAOCAeaAceoxcNeM i«« 

F t I *A V f y r ft* iti*»tbiff»' 

QCCCTCeAtWCCTCA <;KVtiK i ; t lX AA6AeCAai6AC CC C WC t a TT Uta cett iiA 

III IIIMIimilM HIIIMIIIIII lllilMI II iniTllllii 
«eetteeat««e5i«te««ceeette«t««a««a«f««t9«cctet«cttete«t«aeMt 

p u y'n gg«T0a*iA«*«raifge* 

AecAeoectGccMA«AM*..fAfCAecAccAcoecofl«uiTeucr^^ «i« 

nil iiiiiiiiiiMii Mill I III I inn iiTnr mii 

«c«««99c««cg99«9c«4«t««U««ec4tc«eeete«9C«Yttea9eUettec«cet 

t f F t t 

• • • • • , 



lfl«f«tTtrA«t»Vfttf|* 

III nnitiiiniii iiiiiiii iiiTi iiiiini iiiiiiiiiiiiiiii 

^9t«t«tg«««c««ceea««««t««1*«ce««cee««9««eetftUM«««cte9ce«a 

A 

s s It'll A sttip'iroii'xAiaifK' 

cAecAecceececeecAAAACMceAACcceMTceTCACAnececACAecrrccMir 1 la 
IIIIIIIIIIIIMIIIIIMMIt-IMIII M'lMM Nil iin!; • 

■ V . 



BAD ORIGINAL 



W094M2M 32/55 FCT/USM/OSOaa 

nC RE 30 

ft I I T • A 9 f I • « t • **! t t • t I f* 

CCACMaUiaaccCCCTCAaeaCTAACrCA6Aa6CCMACA6AC«ATCAAACia 

Mill IMIIII II 1 t llllMltllMMIII inillll lllllffim 



4|« 



« ter»rLAX« ■>t,AAf liaa' 

AccACAACATACcccmecTccccATACACAAcccccfoecAcceAmrccAticeeee m 
III lltiillllltlllillM llllllllllltlllllllllllllll linil 

rAF'iLf0iit'ift*««ff 

CCCT CC Cmt CC CT C CTCCAakCCACCACTAACCCUaCCCCO CmiVKC gCMa 

iiiiiiiiiiiiini 111111111111111111 Mill iiiiiM I iiiii 

•cet9<c«tec««et«|ct«ic««e«49«ct«acc««2«M9«e9cti«te9tc«c«ffi 

IL«*AtLI«?x'AI«*0>SA?« * 

ACAATteCAOeCCMCCTCrCCGCteTAATeeCTAAeCAACAeCCrAYCeCVOTCrAAtt I4« 

nil 1 lllllllilll II IIIIIM III lllllllllllll IIIII IHII 



ccQAAAtACAceaTAaAncAceterAAAAcmAimAfAfAAfJuuMTAncaM itt 
M IMII II illlllllllllltlMMIIIJIllllMlllinilHIIlTim 



tTAAATTAAACU 911 

illMlillllll 
tta««ttt««c«« 



BAD ORIGINAL 



wo»4ac2n 



PCTAISMIIS083 



33/55 

FIGURE 30 s/S 



AtCAoijxcc^ccxjcxce^ I. 

' ' ' ■^'^rmrnnrnimmiii iri lui m i h' ... 



eCCTCTT 



"■""■irrTrrrrrrnmniM'nnrnVi m ■ ,„ :,. ,, 

>»vt,egae 



WOMOfiM PCT/US>4fl>S083 

FIGURE 31 '/A 
ecniffS ottcUotlte fctvcAct «n4 4«4tic«4 yrotaln it^tAc* 

AerrtceeckcaAcncTcecMCTcvGcec^^ i« 

C6CCCCtOckMCCCAtCCCACC6CCC0CCCCAC«^ Ijg 

TcccAccGCMcecAccaceoifieaaccACcecACeececeeccuccc^ ii« 
€eA6e69C9ceec»«cA«eA«ca€cce6CCACiMci«o^cc^ ii« 
*OTccacctG0cccccxcc6CA w r wcc t c c c c iM Ccfccc^^ )M 

< CT CCC CtlltA CCCC0C6CCC0CCW8 6C CCOqK O C WqC CC ^ ^ s«t 

MUMiTRcmjuMecMfceaiocceTcm «|« 

ceca>(8ccciccGcccMAficwocMnciccccca>cx ng 

■ • t t • ff • 

iaiaga«ti»tas«it»ffA 

aeTQQeooekaj(iMccMccmecteekoen«M^ 

vA«tibfbtetttfif0ftg 

tTCAJlCTOofTCAACMWCCJlCTCMCTAAOCCCJUUC^ 7|« 

fi«rii«tibiBiitf«iii 

tatBf«gflfclXtlA«tA0t 

0 tfSetft<tli«IOfAIAIS 

ACCAirefQCAOVeAAAfiSAttSaeCACTCCCASOCCAOC^ fO« 

qt OW T rCA flAaTCTCCa^TOeMUTCAOTATCUCAaAAC^ iff 

▼ ••itrttstvtftifffft 

TCCACAffCGACATCCACAtfCKeMeAACCCATCTTOfC^ 

t rtttfA0fitfcvg«Aiiig 

AcmciCTcicAAtccAeeeotcTQCRC^ ioa« 

TAcmTccMCfceccAJuKAmrAckeTCATeecKeeuuAm 
AccrrcTACACTAcetccACTccetTTcreTCTcrceeTSAArAeecscirsercAeTce u ee 

CTCCCSCTTT STT3 TT3 rCSCATCT rCtST 7A3A77 rCA I TTASASTTASArseBTTrri 1240 

• .V ».•• ».v* ».*. • a^.ji «?r* 

\ 

BAOOniQINAL 



WOMOfiM FCr/U8»4raS083 

35/55 



no; .IE 31 9^/^ 

tCACXTCCtCWTCCCTCAC 



TCTXTCtOMnatCmAAnACTWCATACa^ ^^^^ 

«««ACACTCeCTCTTCmAfAAAATCAWcrAT^ 
TAfcCCACTAaacXTWAAWAtCCTAAWCecr^ 
AOAATAUCAOMTAAAAAAAAMUUUUUUUUi Iltl' 



UN 

iiaf 



BAD ORIQUML 



WOMOOM FCTAJS94/05083 
i*m U M: I45« 

nGURE32 

CCnirrt fliiclMti4« tt^itACt tf«4wcH yrotalA i«^«Ac« 

«ccccjcctecjccacc«occr^ u. 
«ccttccccc«cr«AjAc^^ 

^ >it.T«f«tie »itff« 0 i« 

TACCCCATCtCA tf TC C CT C CC C Ct m Tl g rr C cCOatCTCaCTC J ^ i«« 
AOCTAOAlCCCmrACCAOCTCfAAarfcAC W CCTC W^^ llf 

cTeceTAAeecTeeAeTCTfrcreAMTTCATetTeMmeTeTCAraeeAeAMArAe loio 



BAD ORiGINAL 



wo 940098 jy^gg PCT/USMM)S083 

ism » »: HGURE 33 i/a. 

CQfiWU A«ctMtl4« to^iAC* •a4 4«4uc«4 pt9%9iH t«^«fie« 

ciucTCMJulcrTccamecAAAceefCAcicccxcAfTctc^ ^ 

KStLtXttAi LA»i«tf« ct 

■ ACTCATCAeemCTACCJlUlCAa«10CCTCtGCCMg^ ... 

ecea^aTcaatcTAaecTeccA«AA6ccATcmTCAJMT^^ 

4titit&«ffltfc?f€Afg I g 

AACriTCT6T6TCAATCGAeCC6ACTCCrfcATCCT6AAMAC^^ 

f b«IC«f«ff«AteTtffTf|"* 

aUUCTCaMCC C JU CA AAAACCCCACCACCTCTACCA C A ^ ..^ 
Kf«T«IIAgfLf«Itft,f t » 

CCCCAmtcAtCCCCCTOCTCCWCTTCCCATCA W wf sWClW W 

B g « 1 g g i gglLgggfcgg 1 g g 

aiCOtCAICAAfiCtACCciUcCflOCCCCACCACCCeA A kc C CCC W }«« 

« g g g V A ■ Of ggffrfygg f g 

0CTC6TCAATCAATAeGTAkTAMUaTeTCATCTCTAeeeM«1A1t^ «M 
LVSg TftllV X BltYggg ^ 

CQCCCAflAO CIVA iilKu ltfC»CTCACTACACtTCCACACCTCMatWCACrAClCt <4e 
AgftftfftgfftffAglgffv 

CACtCACACTCeCMTCAeMC1«eAeeAAT6CAeAeACXOJUe^ 710 
Ygt»fgSfftt«ltgttttgf 



CeeOMA0aM61CTCUfQ6CTieGCA00CCCTC6T6AA1^^ ll« 

ggigtg«t««»ti«fiffcgg 



• 



TCCCACACJUUCCCCTCACTCCTACCCAOACTCtC CTCATAgtCJUUaCATAACCTTAt »M 

AttTvotrgofrisiitvfct 

AecreAecrAAfiCAOMicAAeeeecAeAeATCcAAAtvcAicaaTc^ m« 
AtbfttiiAigt icaotgbaA 

AActcATcrrAeAGcrrmeeAnccccATTccccrrcATtcrcTMCAee^^ leio 

TNLIIASSXrilWAS ffKry«» 

rrtAqcAAccTArcTATCAecAATCAccAceceeecrccTAreTeAccTeTACAmeeA * :9io 

LC t TVSANTT rAA » * P ^ 9 t n 

CAeer:AAsc?;ct:cAAe7:A?ec5srTcscAAATe7e:7K:ec;:s7erAeeAe:Ae - ' i • 

BAOORIQINAL 



WOMOOM FCnDSM/MOU 

FIGURE 33 -i/i. 

CTMMCCeAMTMMCeXTMkncMCleTAMUtcmAtTRAtAmt^ 



BAOOnniNM. 



W0 94/U3M 



FCTAISMMS083 



39/55 

nGUR£34 



# f • • • • 

QQ£2|^0a|(CAEKEKIKVN6OeCFMVKDLSN^^ 1 

QQpftflft4K eAgKBaycvN0QDca(v>o^ 

bfiOE 6ClMCVKDFCIH*Q6aO(VKELRAPS— CKCQQEyFOeAOQEKSMCINl' 



<<saQ a ws Ha) 



vro94at2n 



40/55 




WOMOOM 



FCTAJSMra5083 



41/SS 

FIGURE 36 



r-i-4' 



r«t-&-<-</»-i 

-t-i-i 



r-i-f-A-c-VM-t . 
r«t-i-A-<-<y»*-i-* 

r«l«l*4-«/»</D'«0'-«-fc 



r-i-4-«-€-<i/»*«0 
f-»-4-C-«/»'-0'-t 



f-f- 
r»t- 
r*t 

r-t* 
f«t» 

f-f 
f»t- 
»-t« 

r«t« 
r-t 
r«t» 
f-t-i 
r«t* 
f-f- 
r-i 
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